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ABSTRACT 

We combine Herschel-PAC^ data from the PEP program with Spitzer 24 yum and 16 /vm photometry and 
ultra deep IRS mid-infrared spectra to measure the mid- to far-infrared spectral energy distribution (SED) of 
0.7 < z < 2.5 normal star forming galaxies around the main sequence (the redshift-dependent relation of star 
formation rate and stellar mass). Our very deep data confirm from individual far-infrared detections that z~2 
star formation rates are overestimated if based on 24 jim fluxes and SED templates that are calibrated via local 
trends with luminosity. Galaxies with similar ratios of rest-frame vLv(8) to 8-1000 yum infrared luminosity 
(LIR) tend to lie along lines of constant offset from the main sequence. We explore the relation between SED 
shape and offset in specific star formation rate (SSFR) from the redshift-dependent main sequence. Main se- 
quence galaxies tend to have a similar vLv(8)/LIR regardless of LIR and redshift, up to z~2.5, and vLy(8)/LIR 
decreases with increasing offset above the main sequence in a consistent way at the studied redshifts. We pro- 
vide a redshift-independent calibration of SED templates in the range of 8-60 /vm as a function of Alog(SSFR) 
offset from the main sequence. Redshift dependency enters only through the evolution of the main sequence 
with time. Ultra deep IRS spectra match these SED trends well and verify that they are mostly due to a change 
in ratio of PAH to LIR rather than continua of hidden AGN. Alternatively, we discuss the dependence of 
vLv(8)/LIR on LIR. Same yLv(8)/LIR is reached at increasingly higher LIR at higher redshift, with shifts rela- 
tive to local by 0.5 and 0.8 dex in log(LIR) at redshifts z~l and z~2. Corresponding SED template calibrations 
are provided for use if no stellar masses are in hand. For most of those z~2 star forming galaxies that also host 
an AGN, the mid-infrared is dominated by the star forming component. 

Subject headings: Galaxies: evolution - Galaxies: starburst - Galaxies: fundamental parameters - Cosmology: 
observations - Infrared: galaxies 



1. INTRODUCTION 

In any study of galaxy evolution, the star formation rate 
(SFR) is a key parameter SFRs can be estimated in various 
ways from different wavelengths, but for all reasonably mas- 
sive and dusty star forming galaxies, the far-infrared emission 
will dominate the emitted power and provide a reliable SFR 
estimator In star forming galaxies (SFG) most of the light 
which is emitted from young stars is absorbed by dust and 
re-emitted in the far-infrared, peaking around 60-100 //m rest 
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wavelength. Thus, a measurement or an estimate of the total 
infrared luminosity (LIR, integrated between rest frame 8 and 
1000 //m) of a galaxy is required in order to account for the 
reprocessed fraction of the light. 

The inaccessibility of the far-infrared to ground based in- 
struments and the technological challenges of far-infrared 
space missions have, until recently, left this critical part of 
the spectrum largely unexplored in high redshift galaxies. 
Other wavelengths have been used instead to estimate LIR 
and the mid-infrared, in particular observed by the Spitzer 
space telescope, has been important in detecting high redshift, 
star forming and dust obscured galaxies. The mid-infrared it- 
self however accounts for only a small fraction of the LIR 
and at increasing redshifts, the mid-infrared band covers even 
shorter rest-frame wavelengths, on the edge of the relevant 
8-1000 pm range. The derivation of LIR from mid-infrared 
observations requires large extrapolations that carry signifi- 
cant uncertainties. These extrapolations are typically based 
on families of SEDs that are parametrized as a function of LIR 
and calibrated to match the tren ds observed in the loc al uni- 
verse, in particular the library of ICharv & Elbazl(l2001l CEOl 
in the following). 

Indications that local infrared SEDs may not hold emerged 
with the first more detailed high-z studies. Luminous sub 
millimeter galaxies (SMGs) were i nferred to have lower 
far-infrared dust temperatures (e.g., IChapman et al.l |2005[ 
iPope et aT] [2006). and larger Polycyclic Aromatic Hydrocar- 
bon (PAH) equivalent width an d ratio to LIR (e.g..lLutz et alj 
,2005: Valianteet al. 2007: Meniiidez-Delmestre et al.ll2007F 
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iPope et alj 120081: iMurphv et aTl l2009h . when compared to 
local sources of equivalent LIR. Strong PAH emission 
was also found in some luminous 'dust obscured galax- 
ies' ( DOGs) selected by high 24 //m to opti cal flux ratios 
'e.g., Weedman et al."2006'; Sajina et al."2007'; 'Murp hy et all 



(e.i 
|20( 



2009). Both types of selections induce SED bias and stud- 
ies typically referred to luminous L1R>10'^^ sources. 
These limitations much less apply to studies of a small num- 
ber of lensed galaxies d Oesai et al. 2006; Ri gby et aTl 120081; 
ISiana et al.ll2008l l2009nFadely et al.i i2010). again typically 
indicating changing SEDs and stronger PAH than in local 
galaxies of sim ilar LIR. Using 70 a nd 160 fim stacking of 
24 jjm sources (Papovich et al."2007'), radio stacking of BzK 
galaxies (Daddietal. 2007a) and an SED study including 
IRS spectra of a sample of SMGs, DOGs, and AGN hosts 
(^urphy et al. 2009), evidence was then found that extrapo- 
lations from Spitzer 24 //m mid-infrared photometry tends to 
overestimate LIR for z~2 galaxies by factors of 2-10. This 
concerns a key epoch of galaxy evolution, during which both 
cosmic star formation and AGN activity peak, and which is 
thus at the focus of many current observational and theo- 
retical studies. Whether this relative 'mid-infrared excess' 
is due to continuum emission from dust heated by obscured 
AGNs that boost mid-infrared fluxes (Daddi et al. 2007b), or 
due to a mismatch between the real SEDs at high-z and the 
locally-calibrated templates that are used in converting the 
mid-infrared flux to LIR, remained unclear Stacking of X- 
ray data revealed a large fraction of obscured AGN in 'mid- 
infrared excess' sources (Da ddi et al.l 12007b). but then evi- 
dence from IRS spectroscopy pointed towards strong emis- 
sion from (PAH) not fully accounted for by the templates 
(IMurphv et al.ll2009l:lFadda et al.l l20T^. 

At z> 1 .5 the strong rest-frame 8 yum PAH complex is red- 
shifted into the 24 fj.m filter, which raised the suspicion that 
the local template SEDs have too low ratios of PAH to far- 
infrared emission, which in turn leads to inaccurate extrap- 
olation. This ratio varies significantly among local galaxies. 
Extrapolations from 8 /vm flux to LIR thus critically depend 
on selecting the correct SED template. 

The 8 jum PAH complex includes several broad emission 
bands. Its flux relative to the far-infrared depends strongly 
on th e local ISM conditions ( Laurent et al.ll2000l: ISales et al.l 
I2010h . In the local SED family of CEOl, flie ratio of 8/im 
PAH to LIR is fairly constant at LIR<10'° Lq where it 
is based mostly on normal star forming disks. At higher 
LIR where the ratio of the 8 yum luminosity to LIR de- 
creases, it is based on low-redshift luminous infrared galax- 
ies (LIRG, LIR>10" Lq) and ultra-luminous infrared galax- 
ies (ULIRG, LIR>10i2 Lo). Local (U)LIRGs have distinct 
properties that make them differ from an aggregation of nu- 
merous disk star forming regions. They are typically inter- 
acting or merging systems (Sanders & Mirabel 1996) with 
their luminous star form ation concentrated i n a small re- 
gion of size few 100 pc (ICondon et al.lll99lT) . They show 
high 'star formation efficiencies' SFR/ Mpas, lower CO lumi- 
nosity to g as mass conversion factor (ISolomon et al.l [T997t 
iDownes & Solomon 199 ^, and a 'deficit' of [CIII 158 
compared to LIR (e.g., Malh otra et al.l [T997I ; iLuhman et al.l 
120031; iGraci a-Carpio et al. 2 01l1) . All these properties re- 
flect a compact structure and intense radiation fields. The 
intense radiation fields naturally lead to lower PAH/LIR 
and a warmer far-infrared dust peak (e.g. iDale et aP 120011) . 
placing ULIRGs at the high temperature end of the local 



compactness-temperature relation (IChanial et al.ll2007h . 

All these well established properties and connotations of 
the local ULIRGs, beyond the basic definition of the class 
by LIR, do not necessarily apply for equally luminous z~2 
galaxies. High redshift galaxie s with UL IRG-like lumi- 
nosities are more gas rich (e.g., ITacconi et a l. 2010), can 
be di sk-like galaxies without indication of recent me rgers 
(e.g.. IShapiro et al.1 120081; iForster S chreiberet al."2009) and 
are not as compa c t as the local ULI RGs (Bouche et al. 200*^ 
iKriek e t al."2009': 'Forster Schreiber et al. 201 1 ). Along with 
this come arguments for more normal star formation efficien- 
cies dGenzel et alJj 2010; Daddi et al. 2010a ) and less of a 
[CII] deficit (e.g.. i MaioUno et al.. 2009; Gracia-Carpio et alj 
1201 Ih . Changes in star formation efficiencies and [CII] deficit 
reflect changing local ISM conditions in high-redshift galax- 
ies compared to local ones of the same luminosity, and chal- 
lenge the association of the absolute LIR with a single tem- 
plate SED shape. 

In recent years, a correlation has been established between 
the star f ormation rate (SFR) and stellar rnass of s tar forming 
galax ies dNoeske et al.ll2007l; iDaddi et al.|[2007al; lElbaz et alj 
l2007h . This is now commonly referred to as the 'main se- 
quence' of star forming galaxies, from which passive early 
type galaxies and local ULIRGs are outliers. The absolute 
scaling and possibly the slope of the specific star formation 
rate (SSFR, i.e. star formation rate per unit stellar mass) 
main sequence evolve with redshift. The SSFR significantly 
increa ses with redshift, scaling as (1 -i- z)", where 2.2 < 
n<4 (D addi et al. 2007a; Erb et al. 2006; Damen et al. 2009; 
|Dunne et al. 2009; Pannella et al. 2009: Bouche et al. 201C: 
Rodi ghiero et aLluOlOt lKarim et akHlbl l() . There is an ongo- 
ing debate on the slope /3 of the SSFRocMf main sequence of 
star forming galaxies. At z~l-2, much of the variation that is 
found may be related to how specifically star forming galaxies 
are separated from passive ones. BzK c olor selections seem 
to produce flatter slo pes near jS = -0.1 (IDaddi et alJl2"007al: 
IPanne lla etaTI 120091) compared to j3 ^ -0.4 ... - 0.5 when 
isolatin g SFGs in other ways from origin al mass selections 
(Rodighiero et al. 2010; Karim et alj201 ll) . see also the direct 
comparison in Karim et al. (201^! 

One of the implications of this fairly tight correlation be- 
tween SFR and mass is that star formation in these objects 
must be relatively continuous. This raises yet another reason 
to question the applicability of mid- to far-infrared flux ratios 
that were calibrated on local ULIRGs with strongly peaked 
star formation histories. The possibility arises that SSFR or 
its offset from the main sequence may more generally define 
the SED shape, rather than LIR alone. 

Wit h the adven t of the Herschel space telescope 
(Pilbra tt etaLll2010t) and its PACS instrument (Pog litsch eFal] 
2010), main sequence SFGs at redshifts of z~l-2 can now 
be detected directly in the 100 and 160 yum bands, reach- 
ing objects with star formation rates as low as 10s to a few 
100s Mq yr"' and typical masses above 10'" Mq. This is due 
to the Herschel surveys reaching down to very fairi t fluxes 
of order 1 mJy (iLutz et al.l [201 It iBerta et al.lf201 lb . New 
Herschel-PACS results confirmed and extended the finding 
that extrapolations from observed 24 yum wavelengths for 
z> 1.5 galaxies tend to overestimate LIR by factors of 4 to 
7 (iNordon et al.ll2010l; lElbaz et al.l[2010^ . This is in contrast 
to lower redshi fts where t he extrapolations are reasonably 
accurate (Elba z et al.ll2010l). Using d ata from the GOODS- 
Herschel program. lElbaz et all (1201 1() study the ratio of total 
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TABLE 1 

PACS PHOTOMETRY DEPTHS IN THE FIELDS USED IN THIS STUDY. 



3(T flux limits [mJy] 

Field 70 ^im 100 ^m 160 ^im 

GOODS-N ... 3.0 5.2 

GOODS-S 1.2 1.1 2.0 



IR luminosity to rest-frame 8 jum luminosity, which they find 
to be universal for most star forming galaxies, defining an IR 
main sequence in SFR vs. mass. The outliers (enhanced SFR) 
to this relation are found to be starbursting galaxies with high 
projected star formation surface densities. We compare our 
work to their results in the discussion. 

In this paper we study the relations between the mass, star 
formation rate (also LIR) and mid- to far-infrared SED prop- 
erties of distant star forming galaxies. We focus on two red- 
shift bins: 1.5 < z < 2.5 and 0.7 < z < 1.3 where the 
8 jjm PAH emission complex is covered by the Spitzer MIPS 
24 fim and IRS blue peak-up (16 /um) filters (MIPS24 and 
IRS 16 hereafter) respectively. Herschel-PACS provides the 
crucial LIR measurements and samples the far-infrared SED 
shape. We find that galaxies with constant vLv(8 fim) (where 
vLv= AnD^^^^^^.^vFy) to LIR emission ratios tend to lie in lines 
parallel to the main sequence (Section l3.1b . This motivates 
us to study the SED properties of galaxies as a function of 
their distance from the redshift-dependent main sequence, in 
addition to the traditional view as a function of IR luminos- 
ity. We adopt the redshift dependent main sequence from 
iRodighiero et al.l (120101) . which is based on Herschel-PACS 
data closely related to our sample. 

In Sections|2]and[3]we describe our sample, reconfirm from 
our deeper data the mid-infrared excess overprediction of z~2 
SFRs when extrapolated from 24 fj.m, and motivate the discus- 
sion of SEDs as a function of offset from the main sequence. 
In Section ID we describe the mean-SED redshift-scan fitting 
method. In Section |5] we study the mid- to far-infrared SEDs 
as a function of distance from the star forming main sequence, 
demonstrating that this provides a calibration without explicit 
redshift dependence. We then, in Section|6]compare IRS spec- 
tra with the modified templates. This also demonstrates that 
the change in mid-infrared SEDs is indeed due to scaling of 
the relative PAH strength and not due to continuum emission 
from hot dust. In Section|7]we alternatively present the SEDs 
in the more traditional way as a function of luminosity and de- 
rive a redshift-dependent calibration of vLv(8 jum) as a func- 
tion of LIR. This calibration allows the use of rest-frame 8 /vm 
photometry as a SFR indicator without added knowledge of 
the mass. In Section [8] we discuss mid- to far-infrared SEDs 
of X-ray AGN hosts in the light of the findings for inactive 
galaxies. 

We adopt a (D.„„D.^,Hq) = (0.3,0.7,70km s"' Mpc"') cos- 
mology throughout this paper A Chabrier (2003) initial mass 
function (IMF) is always assumed. Unless specified other- 
wise, we refer to the 'mid-infrared' as the rest frame wave- 
lengths corresponding to the MIPS 24 //m and IRS 16jum blue 
peakup filters and to the 'far-infrared' as the wavelengths cov- 
ered by the PACS 70-160 yum filters. For star forming galax- 
ies at redshift 0.7<z<2.5, these respectively correspond to a 
region dominated by PAH emission features, and by the rest 
frame far-infrared peak and its short wavelength slope. 

2. DATA AND SAMPLES 



TABLE 2 

Number and classification of sources in the z~ 1 and z~2 samples. The two 
numbers in each cell refer to the goods-s/goods-n fields. 







X-ray 


Power- 


Redshift 


SFG 


detected^ 


law^ 


0.7<z<1.3 


148/79 


21/26 


1/2 


1.5<z<2.5 


106/22 


16/10 


13/5 



' The X-ray detection and IRAC bands power-law tags are not mutually 
exclusive. 



TABLE 3 

Numbers and luminosity ranges of the full SFG samples and the SFG 

SAMPLES OBSERVED WITH IRS. AGNs ARE EXCLUDED. 



log(LIR8-1000;jm/Lo) 



redshifts 


Sample 


N 


Min 


Median 


Max 


0.7<z<1.3 


SFG 
SFG-IRS 


227 
9 


10.50 
11.15 


11.36 
11.59 


12.2 
11.92 


1.5<z<2.5 


SFG 
SFG-IRS 


128 
16 


11.34 
11.63 


12.0 
11.97 


12.87 
12.72 



Our far-infrared data are based on Herschel-PACS observa- 
tions in the GOODS fields. These were obtained as part of the 
guaranteed-time PACS Evolutionary Probe (PEFQ) project. 
For details on the observations field layout, and on data re- 
duction we refer to Lutz et al. (2011). The limiting fluxes 
are listed in Table [T] for a distributi on of s ource LIR versus 
redshift see Figure 12 of Lutz et al.| (1201 l l). The PACS 160, 
100 and 70 fj.m fluxes were extracted using sources from a 
Spitzer MIPS 24 Atm catalog as pri ors, following the method 
described in Magnel li et al.l (I2009h . The 24 pm images are 
very deep (^30 piy, 5 cr) and do not limit the completeness of 
the prior extraction, because only a very small fraction ( < 1 %) 
of PACS detectior is will be without a 24 counterp art at 
these depths (see iLutz et all 120111: iMagdis et ani201U for a 
more in-depth analysis). The prior extraction method natu- 
rally matches a 24 pm source to a PACS source. 

For our sample selection we require a redshift (either pho- 
tometric or spectroscopic) and a 160 pm detection, imply- 
ing also 24 pm detection. The total number of sources and 
their breakdown by redshift bins and SFG/AGN categories 
are summarized in Table |2] To complete the coverage of the 
mid-infrared at z~l we als o make use of Spitz er-IRS blue 
peak-up 16 pm catalogs by iTeplitz et al.l (1201 ll) . The limit- 
ing depths (3a-) in GOODS-N and GOODS-S are 40 and 65 
piy, which result in a high detection fraction of PACS sources. 
In the area covered by the 16 pm photometry, at redshifts 
0.7 < z < 1.3, 92% and 77% of the PACS sources in GOODS- 
N and GOODS-S fields respectively have 16 pm counterparts. 
12% and 6% of the PACS sources in these respective fields are 
outside the 16 pm coverage. 

Photometric redshift s in GOODS-N we re derived us- 
ing the code EAZY dBrammer et al.l 120081) applied on a 
PSF-matched mult i-wavelength catalog in UV (GALEX, 
Barger et al.l l2008t FMaitin et al. 2005 ), U band (KPNO 
4m/MOSAIC,'Bargeret al. 2008; CaDak"etiD|2^), optical 
(ACS, HST bviz, GiavaHsco et al. 2004), near-IR (FLAMIN- 
GOS, JHKFI) a nd K,-band from SUBARUAVIRCam 
(IWangetal.ll2010b . and mid-infrared (Spitzer IRAC and 

'http : //www . mpe . mpg . de/ir/Resea rch/PEP/| 
Kindly reduced by Kyoungsoo Lee 
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MIPS, Dickinson et al. in pre p.). Spectroscopic re dshifts in 
GOODS-N were adopted from lBarger et all (12008b . In com- 
parison to the Zspec, the Zphot in GOODS-N have a good ac- 
curacy with the following statistics for Aj=(Zp/,o, - Zspi,c)/(l 
+ '^specY median of -0.008 and median absolute deviation 
(MAD) of 0.033. The rate of Zphot critical failures, defined 
here as: A, > 0.5A/Im/ps24/24, i.e. a redshift deviation that 
produces a corresponding wavelength deviation greater than 
the half-width of the 24 yum filter is 7% . Spectroscopic red- 
shifts in GOODS-S were adopted from Bale stra et all (120101) 
matched to the multi-wavelen gth ( bvizJHK, IRAC, MIPS 
24 ^lm) MUSIC catalog (San tini et all 12009) which also in- 
cludes photometric redshifts for the remaining sources. The 
corresponding Zphot quality statistics for A^ are: median - 
0.0025, MAD 0.038, critical failures 10%. In total, 88% of 
the z~l and 35% of the z~2 samples have spectroscopic red- 
shifts. 

Masses were calculated from the above mentioned multi- 
wavelength catalogs_ofboflifields using the method de- 
scribed in iFontana et al.' ('2004'), with adjustments as de- 
scri bed inlS antini et al. (2009), see also i Fontana et al.l (l2006h 
and iGrazian et al.i (i2006i) . To ensure good mass estimates 
when discussing trends with SSFR, we require for those sam- 
ples a detection in all of the 3.6-5.8 /vm IRAC bands that 
cover the SED rest frame stellar bump (rest frame H-band). 
Only six 160 /im sources in our redshift bins do not match 
this criterion, these are included in Tables |2] and |3] but not 
used below when masses are required. We estimate the error 
on the derived masses by varying the input photometric fluxes 
according to their respective errors and find that the median 
error for the galaxies in our sample is 0.2 dex. While this 
uncertainty is non-negligible on individual objects, it plays 
a much reduced role in a statistics driven study such as the 
one presented here. In particular, the masses are used in mea- 
suring the galaxy SSFR and its difference from the main se- 
quence, that in its elf has a width of 0.3-0.4 dex in SSFR (e.g., 
lElbaz et al.ll2007h . 

Sources which h ave an X-ray counterpart in the Chan- 
dra 2 Ms catalogs ( [Alexander et al.1 120031: iLuo etaLllIOOSh 
have conservatively been flagged as AGNs. At a Umit of 
~ 2 X 1 0~^^ ergs cm^ a nd assuming the X-ray-SFR re- 
lation of iRanafli et al.l (l2003h . the minimum X-ray luminosity 
of z ~ 2 galaxies that can be detected is too bright for star 
formation except for extreme SFRs well above those typical 
for our sample. This is more of a consideration for the z~l 
sample where the galaxies with the highest SFRs can poten- 
tial ly produce e nough X-rays to be detected. When applying 
the lBauer et al.l (2004) classification based on X-ray luminos- 
ity and hardness ratio, 18% of the z~l AGN flagged sources 
may actually be X-ray detected, star forming galaxies. Even 
if 18% of the AGNs (3% of the fufl PACS sample) in the 
0.7<z<1.3 range are star formation dominated in the X-ray, 
the removal of these sources from the star forming galaxies 
sample does not have a significant influence. None of the z ~2 
X-ray sources in our sample i s suspected t o be s tar forma- 
tion dominated according to the lBauer et al.l (120041) classifica- 
tion, though some have very low X-ray phot on counts, which 
makes such a classification highly uncertain. [Alexander et al.l 
(p005) investigated luminous LIR~ IO'^Lq SMGs in the 
GOODS-N field. Among 20 SMGs, 15 are X-ray detected 
with AGN related X-rays, 3 are not X-ray detected, and only 
2 (10%) are X-ray detected with X-rays reported to be dom- 
inated by star formation. This suggests that at z~2 the prob- 
ability to erroneously flag these most luminous star forming 



galaxies as an AGN, solely based on an X-ray detection in 
the Chandra 2 Ms catalog is <10%. This probability drops 
sharply towards lower LIRs typical of our sample. Sources 
that show a monotonic rise in flux in the Spitzer-IRAC 3.6- 
8.0 fim bands are flagged as power-law AGNs. Unless ex- 
plicitly stated otherwise, the X-ray and power-law AGNs are 
excluded from the analysis. 

In this study we use the deep Spitzer-IRS spectra presented 
bv lFadda et al.l (120101) . This spectroscopic sample includes 22 
sources at 1.75 < z < 2.4 and 10 at 0.76 < z < 1.05 that are 
in the PEP GOODS-S field and have a 160 yum detection. 5 of 
the z~2 sources and 1 of the z~l sources have an associated 
X-ray source in the Chandra 2 Ms catalog. This sample se- 
lects faint 24 //m sources (0.14-0.5 mJy) and probes the com- 
mon rest-frame wavelength of 5-12 fim. Table [3] summarizes 
the numbers and luminosity ranges covered by the full SFG 
samples and the IRS spectroscopy sub samples. 

3. THE MID-INFRARED EXCESS: OVERPREDICTION OF Z~2 SFRS 
BY MID-INFRARED EXTRAPOLATION 

Spitzer-hased findings of overpredicted SFRs based on 24 
yum photometry at 1.5<z<2.5 (the 'mid-infrared excess' men- 
tioned in the introduction) were corroborated and extended 
by direct measurements of the total infrared luminosity in the 
far infrared in Nordon etal. (2010) and Elbaz et al. (2010). 
These measurements used GOODS-N imaging with the PACS 
and SPIRE instruments from the Herschel science demonstra- 
tion phase. While resolving many individual sources at these 
redshifts, they still relied on stacking techniques for a large 
fraction of the sources: 5% of PACS detections for the 24 jum 
selected sample in iNordon et al.l (120101) and could not probe 
luminosities smaller than 10^^ Lq. Our current GOODS-S 
data are about a factor of 3 deeper than the GOODS-N data 
used before (Tab. [U. The deeper data significantly increase 
the 160 jum detection fraction of 24 yum sources (30%), mak- 
ing stacks less necessary. It also allows us to probe down to 
log(LIR/LQ)*l 1.3, which we could not effectively reach be- 
fore. 

In Figure [T] using t he GO OD S-S data, we confirm the re- 
sults of Nordon et al. (120101) and lElbaz et aH (2010) (see also 
upper left panel in Figure [13] for a different representation). 
We plot LIR(160), the LIR as measured from PACS 160 yum, 
versus LIR(24) the LIR as ex trapolated from MIPS 24 pm flux 
using Chary & ElbazI ( 1200 l b SED Ubrary. MIPS sources un- 
detected by PACS are stacked in LIR(24) bins. Black triangles 
are the simple, number-weighted means of stacks and detec- 
tions in the LIR(24) range of each stack . All t he procedures 
are similar to those used in 'Nordon et al.l(l2010h . 

The 24 pva 'excess' is clearly evident in Fig[T] At the high 
luminosities the LIR(24) overestimation is by a factor of ~4, 
going down to ~2.5 towards log(LIR/LQ)«il 1 which we could 
not probe directly before. In the following sections we will 
demonstrate that this apparent excess is entirely a result of 
using the wrong SED templates in the 24 pm. flux to LIR con- 
version. 

3.1. What determines the mid-to-far IR SED? 

Traditionally, the use of mid-infrared photometry as a SFR 
indicator relied on empirical correlations between the mid- 
infrared spectral features and the total infrared luminosity. 
Since the mid-infrared itself accounts for only a small frac- 
tion of the 8-1000 pm LIR, such correlations are not trivial. If 
SED shape were strictly related to LIR and given the main se- 
quence correlation between stellar mass and LIR, one would 
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Fig. 1. — The 'mid-infrared excess' of 1.5 < z < 2.5 star forming galaxies in 
GOODS-S: LIR derived from PACS 160 /jm versus LIR derived from MIPS 
24 fim using the original CEOl templates. Circles indicate individual detec- 
tions. Squares indicate stacked 24 /im sources undetected by PACS, where 
the the number of stacked sources is displayed, the hoiizontal bars indicate 
the range of L1R(24) in the stack and the vertical are the error on the mean 
LIR(160). The number- weighted mean of stacks and detections is plotted in 
black triangles and thick black line. 

expect mass and mid- to far-infrared SED properties to cor- 
relate. Dependence of SED on other parameters is expected 
however, as mentioned in the introduction. As we will demon- 
strate in this section, there is a dependence of the vLv(8)/LIR 
ratio on both mass and LIR, such that galaxies of constant 
vLv(8)/LIR lie parallel to the main sequence on the SSFR ver- 
sus mass diagram. 

Out of our full sample, we consider in this section only the 
galaxies with a well determined stellar mass and a redshift 
range of 1.8<z<2.3 (narrower than the full sample). At this 
redshift range the 7.7 fim PAH complex is inside the MIPS24 
filter and the 10 jjm silicate absorption only affects the filter 
fluxes to a limited degree. Here and consistently throughout 
the paper, when quoting vLv(8)/LIR we refer to the value that 
would be measured with a MIPS24 filter for a z=2.0 galaxy of 
the given SED shape. We correct the observed MIPS 24 fim 
vLv(24/l-fz) to 8 fim rest-frame fluxes by finding the CEOl 
template which best fits the vLv(24/l-i-z)/LIR of the filter and 
the galaxy redshift, then redshifting this template to z=2 and 
extracting vLv,(8)/LIR for the MIPS24 filter. For the subsam- 
ple used in this section the extrapolation is over a short rest 
wavelength interval and the associated uncertainties are neg- 
ligible. 

Figure |2] plots this subsample on the SSFR vs. M* dia- 
gram. Red and blue colors indicate galaxies with low <-0.95 
and high >-0.95 vLv(8)/LIR ratios, which splits the sample 
roughly in half. While the separation is not clean, it is clear 
that sources with low vLv(8)/LIR tend to have higher SSFR at 
any given mass. Probing by eye the somewhat blurred border 
between the blue and red points, it appears to be sloped rather 
than at a horizontal line of constant SSFR. Lines of constant 
LIR appear in this plot at a slope of -1 (gray dotted lines). The 
blue/red separation is not as steep as these lines of constant 
LIR. The main sequence from Rodighiero et al. (2010), which 
is based on Herschel-PACS GOODS-N data is plotted as a 
dashed line and its intermediate slope of — 0.5 appears to be 
approximately parallel to the blue/red (low/high vLv(8)/LIR) 
separation line. In this interpretation, galaxies with constant 
vLv(8)/LIR thus appear along this mass-dependent SSFR(Mt) 
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Fig. 2. — SSFR versus mass for 1.8<z<2.3 galaxies. Galaxies with 
log(vL^(8)/LIR)<-0.95 are in red and galaxies with log(yL,,(8)/LIR)>-0.95 
are in blue. The dashed line is the main sequence of Rodighiero et al. (2010) 
with a slope of -0.5. Gray dotted hnes with a slope of -1 indicate lines of con- 
stant LIR. The gray hatched area is excluded by the PACS 1 60 /jm detection 
limit for the GOODS-S sample. 

of the main sequence, rather than lines of constant SSFR or 
constant LIR. While lines of vLv(8)/LIR seem to be closest to 
the slope of the main sequence, in practice over this limited 
mass range and the scatter in vLy(8)/LIR, binning the galaxies 
with respect to the main sequence (slope -0.5), constant SSFR 
lines (0 slope), or constant LIR lines (-1 slope) will not pro- 
duce drastically different selections in terms of vLv(8)/LIR. 

We quantify the latter statement via a Spearman rank 
correlation test of vLv(8)/LIR against LIR, SSFR and the 
offset from the main sequence at the given galaxy mass 
Alog(SSFR)MS = log(SSFR) - log(SSFR)MS- The results are 
plotted in Figure |3] All three tests reject the null hypothesis 
of uncorrelated data with high significance, but the best corre- 
lation coefficient and highest significance of the three is with 
the distance from the main sequence. We note that in case of a 
perfect correlation of vLv(8)/LIR and Alog(SSFR)MS, signif- 
icant correlation would also be expected with LIR and SSFR 
over the range studied, given the definitions of these quanti- 
ties. 

We thus found a tendency of galaxies with similar SEDs 
to lie along sloped lines in the SSFR versus mass diagram. 
This tendency corresponds to the best correlation found be- 
tween vLv(8)/LIR and Alog(SSFR)MS- It is consistent with 
the Rodighiero et al. (2010) main sequence slope, within the 
statistics of this small sample. The slope of these lines as 
opposed to a constant SSFR makes also the correlation of 
vLv(8)/LIR with LIR significant, even though it is not the best 
relation of the three. The physical origin of the correlation 
between SED shape and LIR hence seems more indirect, not 
fundamental as we shall discuss further in Section |9] Still, 
for practical purposes this correlation can be used to measure 
LIR from 24 fim fluxes in cases where the mass is not well 
determined, or unknown. 

In the following sections we will explore the mid-to-far IR 
SEDs in two ways: in the distance from the main sequence 
that seems to be the more fundamental relation, and the clas- 
sical and simpler dependence on LIR. 

4. METHOD: MEAN SED REDSHIFT SCAN 

The study of galaxy SEDs in the mid-to-far IR range can be 
broken into two parts; the shape of the SED and the overall 
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amplitude, i.e. the total infrared luminosity (LIR), defined as 
the total integrated luminosity between 8-1000 yum. 

For studying the SED shape, one must overcome a few dif- 
ficulties. The first is that due to various reasons, many of 
the galaxies are not detected at all wavelengths. The non- 
detections must be taken into account in order not to bias the 
results. The second is that at different redshifts the filters 
sample different rest-frame wavelengths. Normally this re- 
quires some inter/extrapolation to common wavelengths, that 
require assumptions on the same region of the SED we wish 
to study. This is most problematic in the mid-infrared where 
there are strong spectral variations on scales smaller than the 
filter width. 

We wish to study the mean properties of mid-to-far IR emis- 
sion. Therefore, instead of fitting a template SED to every 
individual galaxy, we take a slightly different approach: we 
fit a template to the combined photometric data points of a 
galaxy population in their rest-frame. This treats the com- 
bined data as if originating from a single mean galaxy, ob- 
served by many different filters which are slightly shifted in 
their central wavelengths. In this method, sampling a nar- 
row, yet non-negligible redshift interval requires no interpola- 
tion/extrapolation to a common rest frame-wavelength. Such 
extrapolations can be problematic in the mid-infrared due to 
PAH features. It also does not force the SEDs to cross at a 
single wavelength chosen for the normalization and the sam- 
pling of slightly different rest-frame wavelengths by the filters 
improves the effective resolution. 

The first and most important piece of information we need 
for each galaxy is its LIR. Herschel-PACS, especially with 
the 160 fim filter, allows us to accurately determine the LIR 
almost independently of the SED shape. Generally, in our 
fields and at the redshifts of interest, out of the 3 PACS bands, 
160 i-im has the highest detection rate. It also samples close to 
60 fim rest-frame at z~2, wh ich leads to a very robust LIR 
determination as lElbaz et all (2010) demonstrated. 60 jj.m 
rest-frame is a 'sweet spot' for monochromatic LIR determi- 
nation and we expand on this point further in Appendix |A] 
For the 1.5<z<2.5 sample we measure LIR by fitting CEOl 
templates to 160 /^m flux alone, while preserving the library 
LIR-template relation (monochromatic fitting), even in cases 
where shorter wavelength fluxes are available. At 0.7<z<1.3 
the PACS 160 fj.m filter has moved farther away from the rest- 
frame 60 fim sweet-spot and PACS 100 fj.m filter has moved 
closer For galaxies at these lower redshifts we fit the best 
combination of CEOl template shape and scale (two free pa- 



rameters) to the 100 and 160 jum points simultaneously, ex- 
cept for a few cases at the lowest luminosities where only 160 
fj.m is available. In the latter case a monochromatic fit to LIR 
is used. For further discussion on the LIR determination we 
refer the reader to Appendix lAl 

After determining the LIR of each galaxy in our full sam- 
ple, we are able to bin the sample by SSFR offset from the 
main sequence or by LIR. For each bin subsample we fit a sin- 
gle template to the vLy/LIR values of all sources combined. 
Dividing all filter luminosities of a given galaxy by its own 
LIR serves as normalization and does not affect the flux ra- 
tios. Each galaxy thus contributes nearly equally to the over- 
all vLv/LIR template fit, only depending on its photometry er- 
rors, but independent of its absolute luminosity. As a working 
example, we start by selecting 1.5 < z < 2.5 PACS 160 fim 
sources according to their Alog(SSFR)MS distance from the 
main sequence in the SSFR- M, plane . LIR i s converted to 
SFR using the relation from K ennicuti (Il998al) . converted to 
a Chabrier IMF (.Chabrier.2003i : 

(^).1.09xl0-(^) (1) 

The exact parametrization of the main sequence of star 
forming galaxies is somewhat ambiguous. Different studies 
(at different wavelengths and using different selections of star 
forming galaxies) find different slopes n and a corresponding 
scaling Q.ms for the relation: 

In this study we use th e main sequence as determined by 
iRodighiero et al. l (2010', corrected to a Chabrier IMF) that 
was derived from PEP data in GOODS-N: (n,log(QMs)) = 
(-0.496, 5.243) for z~2 and («,log(fiMs)) = (-0.394,3.945) 
for z~l. The redshift of 1 is between two bins in 
IRodighiero et al.l (120101) and we adopt the simple mean of the 
main-sequence above and below it. 

We divide our 1.5 < z < 2.5 sample into 6 bins as illus- 
trated in Figure |4]top panel. The bins are numbered 1-6 with 
increasing A log(SSFR)MS- Also indicated is the region below 
a line with slope -1 that is excluded by the 160 //m flux limit. 
The combined photometry for each Alog(SSFR)MS bin is plot- 
ted in Figure |5] top. Each filter flux is converted to vLy/LIR 
units according to the corresponding galaxy LIR and the data 
points are color coded according to their filter. Stacks of (indi- 
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vidually) undetected 100 and 70 yum (but detected at 160 jum) 
sources are also plotted in the respective filter color. On top 
of each stack point we indicate the number of sources in the 
stack and the error bars indicate the uncertainty on the mean 
vLy/LIR in the stack. For details about the stacking procedure 
see AppendixlBl 

An SED template is then fitted to the combined photometry 
of all galaxies in each bin using minimization. Since we are 
not making repeated measurements of the same galaxy, but 
rather of a galaxy population, the intrinsic scatter in the pop- 
ulation must be taken into account when calculating This 
has a significant eff'ect on the relative weight of each photo- 
metric point and the relative weight of the stacks. More details 
about the minimization are available in Appendix |C] We 
would like to keep the description of the SED shapes as sim- 
ple as possible. Due to its popularity, its use in determining 
LIR and the simple way in w hich it is defi ned, we choose to 
use the CEOl Ubrary. MagnelU et_al.l (!2009l) also found that at 
z~l this library reproduce the 24/70 micron colors better than 
a few other popular libraries. Even though we concentrate on 
CEOl, we will supply the means to calibrate any other library. 
The templates in the CEOl library represent the mid-infrared 
to sub millimeter SEDs of local galaxies according to their to- 
tal IR luminosity, and are uniquely identified by their LIR. We 
also use them below as SED shape templates that are encod- 
ing relative ratios between the far- and the mid-infrared, and 
the PAH features in particular, without enforcing the locally 
calibrated link to total IR luminosity. To avoid confusion, we 
refer in this case to the template identifier as Aceoi in units of 
Lq, irrespective of the IR luminosity to which it might have 
been rescaled. 

We thus search the CEOl library for the best fitting tem- 
plate shape. Since both data and templates are in vLy/LIR 
units, there is no scaling involved in this fit. We split at 15 /im 
rest-frame and fit the mid- and far-infrared independently. 
This split differentiates two distinct wavelength and physi- 
cal regions: The mid-infrared is dominated by PAH emission 
from cold dust regions, silicate absorption and possible con- 
tinuum from AGN-heated hot dust, while the far-infrared peak 
is dominated by graybody emission of dust. In the middle, in 
the range of 15-60 /im (probed by PACS), a tail distribution of 
hotter dust temperatures and t ransient heatin g of small grains 
shape the SED slope (e.g. De sert et al .1119901) . 

In each panel of Figure |5] we indicate the best fitting mid- 
and far-infrared Ac£oi templates. The best fitted SED is plot- 
ted as a solid black line. Broadband filter fluxes represent 
the weighted emission from a significant wavelength interval. 
For each filter we indicate with a solid colored line the ex- 
pected filter flux for the fitted template by gradually redshift- 
ing the fitted template in the range 1.5<z<2.5 and applying 
the filter transmission to the template. The central wavelength 
of the filter scans a corresponding range of rest-frame wave- 
lengths (redshift-scan). The distinction between the template 
and the curve of expected filter flux is most important in the 
mid-infrared region. 

One final note should be made: Since we require PACS 
160 jjm detections in order to estimate LIR, sources unde- 
tected in this filter are excluded. We have argued above and 
in Appendix IaI that 160 /im is a good proxy for LIR at these 
redshifts over a wide range of SEDs. When later binning by 
LIR (Section |7ll, non-detections at 160 /im thus do not in- 
troduce significant biases as we operate in narrow LIR bins: 
the near independence of rest frame vLv(60 //m)/LIR from 
SED shape means that non-detections are mostly due to Um- 
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Fig. 4. — Top: The binning of the 1.5 < z < 2.5 sample according to the 
distance from the main sequence. Bottom: Similar to above for the 0.7 < z< 
1.3 sample. 

iting luminosity-distance for a given LIR. When binning by 
Alog(SSFR)MS, this translates into a more complex combi- 
nation of limiting luminosity-distance and M». Unless there 
is an added dependence of SED shape on the absolute mass, 
which is not a result of the SFR-M, relation, this binning does 
not introduce biases as well. 

5. THE MID-TO-FAR IR SED AS A FUNCTION OF DISTANCE FROM 
THE MAIN SEQUENCE 



In Section 13.11 we found that galaxies with similar 
vLv(8)/LIR tend to lie parallel to the main sequence in their 
SSFR, and Section |4] described our method to build up mean 
SEDs for z~2 galaxies as a function of offset from the main 
sequence. We follow a similar procedure for our 0.7 < z < 1 .3 
sample. At z~l the MIPS24 filter samples rest-frame wave- 
lengths A ~ 12 /im, which cover entirely diff'erent spectral fea- 
tures than those at 8 fxm. We therefore add the 16 /im IRS Blue 
Peak-up filter data from i Teplitz et al.l (1201 ll) that samples 8 
/im rest-frame. The sample is divided into 6 Alog(SSFR)MS 
bins as illustrated in Figure |4] bottom panel and SEDs are 
again fitted to the combined photometry of all galaxies in each 
bin. The fit results are plotted in Figure |5]bottom. 

For both z~2 and z~l (Figure |5] top and bottom respec- 
tively), in the sequence of increasing bin number and SSFR, 
there is a clear trend towards fitting the mid-infrared ratio by 
template shapes corresponding to higher luminosities Ac£oi, 
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Fig. 5. — Top.' Mean SEDs fitted to the 1.5 < z < 2.5 sample, binned by the Alog(SSFR)MS distance from the main sequence hne. The number on the comer 
of each panel indicate the corresponding Alog(SSFR)MS bin in Figure|4]top panel. Circles and triangles are for GOODS-S and GOODS-N sources respectively. 
Squares mark stacks of GOODS-S undetected sources with the number of stacked sources indicated on them. The colors: red, green, blue and purple, correspond 
to the filter wavelengths: PACS 160, 100, 70 and MIPS 24 /jm. CEOl library SED template shapes vLy/LlR are fitted to the sample separately for the mid- and 
far-infrared rest wavelength ranges below and above 15 /jm. These fits are plotted in solid black and the best fitting SED shapes are labeled by their nominal 
luminosity A. The log(A) template identifier of the best fitting templates are noted separately for the mid- and far-infrared. The expected filter fluxes for the 
best-fit template are plotted as thick lines matching the filter color Bottom: Same as above for the 0.7 < z. < 1.3 sample. The additional 16 fim photometry is 
colored in cyan. The panel numbers match the bins plotted in Figure |4]bottom panel. 
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implying lower log(vLy{^)/LIR). In the next section ( 15.1b we 
will quantify these effects. 

In contrast, the far-infrared /I > 15 fj.m is fitted by similar 
templates in all six bins of the z~2 sample. In the CEOl li- 
brary all templates in the range 1 1 <log(A/L0)<12. 1 are nearly 
identical in the rest frame wavelengths covered by the 3 PACS 
bands (20-60 fj.m, see also Figure [14] in Appendix |A|i when 
plotted in vLy(A)/LlR scale. This part of the far-infrared SED, 
among this range of templates, is fairly insensitive to the ex- 
act location of the SED peak. For the z~2 sample we hence 
cannot reliably infer dust temperatures and which template 
will be the most accurate in extrapolating to longer rest-frame 
wavelengths. We note, however, that the templates fitted to 
the mid-infrared (/I < 15 fim) also fit the longer wavelengths 
up to 60 jum at z~2. For the z~ 1 sample, the best-fit templates 
for the far-infrared up to ~90 /im rest-frame are close to the 
templates fitted to the mid-infrared, as indicated by their re- 
spective AcEoi . Within the degeneracy of the templates we 
thus find consistent fit results for the mid- and far-infrared, 
but do not strongly constrain the shape of the rest far-infrared 
peak given the rest wavelength coverage of the PACS data. 

5.1. Building a SSFR-scaled SED library 

We wish to derive a recipe for calibrating the CEOl SEDs 
by offset from the main sequence. Figure |6] concentrates all 
the essential mid-infrared information from the template fit- 
tings described above into one figure and presents our main 
result. It plots the vLv(8 jt/m)/LIR versus the Alog(SSFR)Ms 
deviation from the main sequence. For each template fitted to 
the mid-infrared photometry in Figure |5] we derive the mean 
vLv(8)/LIR as observed through the MIPS 24 /jm filter at z=2 
(centered on 8 fim rest frame). This scaling curve can be used 



to recalibrate any SED library (AppendixiDt. Filter convolved 
flux is used instead of the monochromatic template point at 
8 jum because our template calibration is photometric, in a 
structured region of the SED. We cannot be sure that the tem- 
plate is correct in its higher resolution details, which also may 
vary in other libraries, but note good agreement of PAH dom- 
inated CEOl template shapes with IRS spectra for a subset of 
our sample (Section |6]l. For z~l, even though the 8 jum was 
observed by the 16 fim filter, we again specify vLv(8)/LIR in 
terms of the z=2.0 MIPS 24 /urn flux for the template. 

The resulting curves of vLy(8)/LIR as a function of 
Alog(SSFR)MS for the two redshifts bins are plotted in 
Figure |6] The slopes of the z~2 and z~l samples for 
Alog(SSFR)MS S are practically identical and the curves 
agree within the data uncertainty (plotted error bars), and in 
particular the added systematic ambiguity of placement of the 
main sequence itsel f at different redshifts. For our adopted 
main sequence from lRodighiero et al.l (120101) we estimate this 
placement to carry an uncertainty of ~0. 1-0. 15 dex, also con- 
sidering slightly different methods to derive LIR. This main 
sequence ambiguity results in horizontal shifts of the curves 
and can account for the Alog(SSFR)MS shift between the z ~1 
and z~2 curves. 

Galaxies which are on the main sequence at the two redshift 
bins have nearly the same yLv(8 /vm)/LIR ratio, and this ratio 
changes with the distance from the main sequence in a similar 
fashion. More importantly, it means that the increase with 
redshift of the SSFR at a given mass (i.e. the scaling of the 
main sequence with redshift), is the same as the scaling that 
affects SED shapes. 

The clear negative slope indicates a suppression of 8 jum 
emission vs. the LIR for galaxies with SSFR higher than the 
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main sequence. Quantitative details of this result are sensitive 
to the slope and in particular scaling of the adopted main se- 
quence which vary between different studies. Adopting a dif- 
ferent main sequence will change the absolute calibration de- 
rived here, but not the fundamental behavior, namely the satu- 
ration of log(yLv(8)/LlR)=-0.58 on/below the main sequence. 
It will hardly change the slope of the curves in Figure |6] 

For objects with SSFR below the main sequence, which 
are available only in the z~l sample, log(vLy(8)/LlR) satu- 
rates into a constant value of -0.58+0.04. This is the typ- 
ical (near constant) value for templates in the CEOl library 
for local galaxies of log(LlR/L0)<lO.2. In the CEOl library 
log(vLy(8)/LlR) was constrained by comparing ISOCAM- 
LW2 6.7 yum (also covering the 7.7 ^ m PAH band) with th e 
total infrared luminosity (Figure 3 in iCharv & Elbazll200Tl) . 
The correlation is almost linear up to log(LIR/Lo)~10 and 
then breaks towards progressively reduced vLv(6.7)/LIR with 
increasing LIR. A characterization of log(vLv(8)/LlR) as a 
function of main sequence offset Alog(SSFR)MS for local 
galaxies is beyond the scope of this work, but we can safely 
assume that the normal galaxies below the log(LlR/L0)=slO 
break point for vLv(8)/LlR in CEOl are main sequence star 
formers. We use their log(vLv(8)/LlR) to indicate the location 
of the z=0 main sequence in Figure |6] (green square). The 
break-point luminosity translates into a SFR of ~1 Mq yr ', 
which at z=0 means ~ 1 0'° Mq main se quence galaxies 
dBrinchmann et all 120041: iPeng et al.1 |2010^. Due to the 
specifics of the CEOl sample, higher luminosities are mostly 
sampled by very luminous z=0 ULIRG mergers for which 
8 ^im emission is suppre ssed. With masses of a few lO'^Mg 
(e.g.. lDasvra et al.ll2006l) these ULlRGs will be clearly above 
the local main sequence. This implies a downward trend in 
Figure |6] at pos itive SSFR offs ets above the main sequence, 
also for z=0. Elbaz et aP (1201 1. their Equation 5) find con- 
sistent values of LIR/vLy(8)=4 in local main sequence galax- 
ies, and that this ratio holds for main sequence galaxies up to 
z~2.5. 

For the z~2 sample in Figure |6] we could not resolve the 
break point between sloped log(vLv(8)/LlR) and the likely 
constant value at lower Alog(SSFR)MS- This is due to our 
inability to include galaxies f^urther below the main sequence 
at this redshift. The lowest Alog(SSFR)MS point which is on 
and slightly below the main sequence is consistent with the 
log(vLv(8)/LlR)=-0.58 limit derived from z~l and may al- 
ready include galaxies past the break. 

Figure|6]implies that there is a single relation between SED 
shape (i.e. vLy(8)/LlR and the matching 8-60 fim template 
in general) and the SSFR offset from the main sequence that 
applies all the way from z~0 to z~2. This relation can be 
expressed in terms of vLy(8)/LlR: 



\og{vLy(8)/LIR) = 

-0.58rf.04 : Ams < -0.14 

-0.65±o.o3 - 0.50±o.o6 X Alog(SSFR)Ms : Ams > -0.14 

(3) 

where Ams = Alog(SSFR)MS is expressed in Gyr ' units and 
we adopt the mean from z~l and z~2 for the sloped part. 
The errors on the fitted slope and intercept coefficients are 
correlated with p - -0.75. For the specific case of the CEOl 
library, it can be approximated by luminosity-labels Aceoi of 
this hbrary; 




Log LIR(160) 

Fig. 7. — The scatter of LIR extrapolated for 1.5<z<2.5 sources from ob- 
served 24 yum using templates rescaled according to the sources offset from 
the main sequence, versus LIR measured from 160 /jm. Blue circles are SFG, 
red circles are sources with power-law SED in the IRAC bands and black 
crosses ai'e X-ray AGNs. The blue dashed and dotted lines indicate the over- 
all mean and standard deviation for the SFG. 



log(AcEOl) = 

10.38 : Ams < -0.14 (4) 

10.58 + 1.44 X Alog(SSFR)Ms : Ams > -0.14 

We find a consistent decrease of the mid-to-far infrared ra- 
tio vLv(8)/LlR in the SED for galaxies that are offset above 
the main sequence. The strong scaling of SFR on the main se- 
quence with redshift SFR/M, cc (1 +z)^-'^ dBouche etalj2010l) 
makes LIRG and ULlRG-like luminosities common for mas- 
sive star forming galaxies on the main sequence at z~2. Most 
of these highly star forming galaxies indeed have IR SED 
shapes that resemble lower redshift main sequence galaxies. 
For both z~l and z~2, a decrease in vLv(8)/LlR is observed 
with increasing SSFR above the main sequence, but at much 
higher absolute luminosity than locally. 

For the practical application of deriving LIR for a galaxy of 
known redshift, stellar mass, and rest frame 8 jum flux we can 
use Equation in (or Equation [3] in combination with template 
libraries other than CEOl), stellar mass and the main sequence 
at the given redshift to derive a CEOl style template library 
as a function of LIR, which is then specific to this redshift 
and stellar mass (see Appendix IdTi. This library can then be 
applied in the usual way to estimate LIR from the 24 yum flux 
and redshift. We have tested this method using it to estimate 
L1R(24) for the 1.5 < z < 2.5 sources from the MIPS 24 fim 
fluxes as described. The comparison with the LIR derived 
from the 160 fim photometry at the far-infrared peak is plotted 
in Figure|7] We can see from this figure that there is no bias in 
derived L1R(24) and no trends with LIR, albeit the <0.4 dex 
scatter is significant. We discuss this calibration for the u se of 
8 jum as a SFR indicator versus other calibrations in Sect. 19.31 

6. IRS SPECTRA OF HIGH-Z GALAXIES 

In the previous section we have derived a calibration for 
the SED templates as a function of the ASSFRms distance 
from the main sequence. The method in which we shift the 
filters across the spectral features by observing galaxies of 
slightly different redshifts (Figure |5) can only provide lim- 
ited information regarding the detailed spectra shapes, which 
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Fig. 8. — IRS spectra of z~2 galaxies and associted templates. Top panels: 
Template SEDs (black) scaled according to the SSFR distance from the main 
sequence, increasing from top to bottom, compared with stacked IRS spectra 
for non-AGN galaxies (blue). The three panels correspond to the selection 
bins as indicated by the number at the top-left of each panel. The templates 
are scaled according to the 160 pm flux and are not a fit to the spectra. Bot- 
tom: The selection bins for the stacked spectra. The bins are at a constant 
SSFR distance from the main sequence, which is marked as a thick solid Une. 



are consistent with the presence of PAHs. In this section 
we extend this analysis using a sample of 24 IRS spectra of 
z~2 log(LIR/LQ)*12 and 12 spectra of z~l log(LIR/ LH)^l 1 
source s in our field, which was first presented by Fad da et al.l 
(120101) . In total, 3 1 of the IRS sources are detected by PACS at 
160 /im and 6 of them have an associated X-ray detection and 
are flagged as AGNs. The IRS spectra allow us to test the SED 
templates, which were assumed to have PAH features similar 
to those observed in local galaxies, but were calibrated above 
using photometry alone. 

We first focus on the 16 non-AGN z~2 objects. Due to 
the weak signal in each individual spectrum we stack several 
spectra in bins according to the distance of the galaxies from 
the main sequence. The bottom panel in Figure |8] shows the 
location of the galaxies on the SSFR versus stellar mass di- 
agram and the three selection bins at constant distance from 
the main sequence. Before stacking, the spectrum of each 
object is converted to the vL^/LIR scale according to the indi- 
vidual LIR measured in the far-infrared by PACS. This avoids 
the few most luminous sources dominating the stacked result. 
The spectra are then averaged in vLy/LIR scale with equal 
weights. Equal weights prevent the few brightest and best S/N 
sources from dominating the final mean. We then compare 
the stacked spectra to the modified templates as a function of 
SSFR main sequence offset, as derived in Section|5] The tem- 
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Fig. 9. — IRS spectra of z~l galaxies and associted templates. Top panels: 
Template SEDs (black) scaled according to the SSFR distance from the main 
sequence, increasing from top to bottom, compared with stacked IRS spectra 
for non-AGN galaxies (blue). The two panels correspond to above and below 
the selection line indicated by the number at the top-left of each panel and 
illustrated in the bottom panel. The templates are scaled according to the 
160 fim flux and are not a fit to the spectra. Bottom: The selection bins for 
the stacked spectra. The sample is split at Alog(SSFR)ivis=0.3 indicated by a 
dashed black line. The main sequence is marked as a thick solid line. 



plates are not fitted to the IRS spectrum - they are scaled by 
PACS 160 yum flux and extrapolate down to the mid-infrared 
around rest 6 fim. The result is plotted in Figure |8] 

The IRS spectra are clearly PAH dominated. This 
strongly argues that PAH emission rather than an AGN 
continuum must drive the 'mid-infrared excess' and asso- 
ciated mid- infrared based SFR overestimates, as noted by 
Fadda et alJ (2010) fo r this sample and Rigby et al. (2008) and 
Murphv et aLj (l2009h for ot her IRS samples. It is also consis- 
tent with the conclusions of lElbaz et alj (1201 ll) based on pho- 
tometry. In addition, the decrease in vLy(8)/LIR when moving 
above the main sequence is indeed driven by a changing ratio 
of PAH to far-infrared, related to changing conditions in the 
interstellar medium. The stacked spectra of galaxies on the 
main sequence or slightly below it (region 1 , bottom panel) 
are described very well by the rescaled template for the main 
sequence. These galaxies with the strongest yLv(8)/LIR relate 
to the ceiling val ue of 1 A<m for the 6.2 jt/m PAH equivalent 
width reported in iFadda et"aD (120 1 Ol) . 

The templates were calibrated in Section |5] according to 
photometry which covered mainly the 7.7 fim PAH peak, but 
in the IRS spectrum the 6.2 and 11.3 fim peaks, as well as 
the 10 yum silicate absorption are also well matched. The z~2 
main sequence galaxies indeed resemble scaled-up local tem- 
plates even in their spectral details: PAH emissions, silicate 
absorption, continuum level. This is in broad agreement with 
previous results that the SEDs of high redshift galaxies resem- 
ble scaled up SEDs of locals galaxies of lower luminosities 
(see the introduction). We here argue that this resemblance 
is based on the relation of the galaxies to the main sequence. 
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rather than their absolute luminosities. 

As we progressively move away from the main sequence 
(region 2 and 3 in Figure |8]i the non-linearity in the mid- 
to far-infrared ratio is evident. Not only does the ratio of 
7.7 yum PAH to far-infrared decrease but the 6.2 and in par- 
ticular 11.3 jjm emissions are fainter relative to far-infrared 
as well. The typical width o f the main sequence at a given 
mass is 0.3-0.4 dex in SSFR dNoeske et al.ll2007HElbaz et alj 
l2007h and so bin 3 can be considered fully above the main 
sequence and show the strongest effect. While the templates 
are close to the spectra around 8 fim due to the way they were 
calibrated, they overestimate the emission from neutral PAHs 
at 11.3 fim as well as the continuum level at wavelengths of 
10 yum and above. These deviations compared to the stack of 4 
weak IRS spectra cannot be tested in the photometry because 
there are no filters that observe these rest-frame wavelengths 
at 1.5<z<2.5. 

A similar picture arises from the sample of z~l spectra. 
Nine galaxies with IR luminosity similar to local LlRGs and 
without an associated X-ray detection are detected by PACS 
160 yum and 8 of them are detected at 100 yum as well. We split 
them at Alog(SSFR)MS=0.3 into two bins: on and above the 
main sequence. The stacked spectra are plotted in Figure |9] 
The spectra are matched very well by the scaled templates 
and the relative difference in PAH strength between, on, and 
above main sequence is clearly visible. The suppression of 
PAH strength is for this sample clearly associated with off- 
set above the main sequence rather than higher LIR. Given 
the consistency of photometric and spectroscopic results, it is 
quantitatively expressed by Figure |6] and Equation |3] Unfor- 
tunately at this redshift, the 1 1.3 yum PAH emission is outside 
the observed spectral range. 

In a scenario where star forming galaxies above the main 
sequ ence are c haracterized by compact intense starbursts 
(e.g..lWuyts et al. 2011a; Elbaz et al. 2011), two factors can 
contribute to the weakening of all PAHs relative to the far- 
infrared for these galaxies. First, high radiation field intensi- 
ties in these compact regions can lead to inherently reduced 
PAH emission along with warmer large grain dust tempera- 
tures (e.g. Dale et al. 2001). Second, in the galaxies above the 
main sequence the PAH emission, even in the mid-infrared, 
could simply be obscured by higher column densities. Given 
the mid-infrared extinction curve with its strong silicate fea- 
ture around ~10 fj.m, such obscuration of a PAH dominated 
spectrum will cause reduced 8.6 and 11.3 yum PAHs in the 
wings of the silicate feature, compared to the 7.7yL/m one. 
This is indeed observed in some local galaxies ( Spoon et al. 
I2OO0I) . The S/N of the stack of four objects above the z~2 
main sequence is not sufficient for such a test. From high 
S/N IRS spectra of local UL IRGs (thus above the main se- 
quence) IVeilleux et alj ([2009") conclude that the obscuration 
of the PAH component is typically moderate: Ay <10. This 
analogy suggests intense radiation fields to be the dominant 
c ause of the PAH weaknes s, implying also warmer dust. 

iKelson & Holdenl j201Q') argued for a strong contribution of 
thermally pulsating asymptotic giant branch (TP-AGB) stars 
to the mid-infrared emission of galaxies from z=0 to z=2, that 
could in principle be another cause for overestimated mid- 
infrared based star formation rates. Mid-infrared spectral evi- 
dence for AGB stars in integrated galaxy light indeed exists 
via detection of silicate emission in passively evolving lo- 
cal galaxie s, with favorably lo w ratios of current to past star 
formation (Bressan e t al.ll2006) . The IRS spectra of our star 
forming galaxies instead are dominated by luminous PAHs 



from active star formation, arguing against a noticeable AGB 
contribution to their mid-infrared excess. 

We conclude that the 8 //m 'excess' which caused the over- 
estimation of SFR derived from MIPS 24 jum photometry for 
z>1.5 galaxies was caused almost entirely by underestimation 
of the PAH emission when using local templates. The calibra- 
tion derived in Section|5] that selects the templates by distance 
from the main sequence instead of LIR correctly predicts the 
8 yum emission, which is dominated by the PAH emission. The 
spectra of the SFG (no X-ray detection or IRAC-bands power- 
law) show no significant continuum contribution which could 
indicate an obscured AGN, though do not exclude the pres- 
ence of a low luminosity one. 

7. THE MID-TO-FAR IR SED AS A FUNCTION OF LUMINOSITY 

In this section we will derive a benchmark by which SED li- 
braries can be calibrated to match high redshift galaxies with- 
out required knowledge of the galaxy mass. We use the meth- 
ods described in section |4] but in this case using narrow LIR 
bins, for the same z~2 and z~l parent samples. 

The 1.5 < z < 2.5 sample is divided into 6 LIR bins in 
the range 1 1.3<log(LIR/L0)<12.8. The 0.7 < z < 1.3 sample 
is divided into similar LIR bins, however we are unable to 
populate the two highest luminosity bins over our fields due 
to the rarity of such objects at z~l. Lower luminosities of the 
z~l sample are limited by the depth of the 16 /im data and 
in order to maintain high 16 yum detection rate, we limit the 
lowest luminosity similar to that of the z~2 sample. 

The result is plotted in Figure [TO] top and bottom for red- 
shifts z~2 and z~l respectively. The LIR bin range is indi- 
cated in each panel. For comparison, we also plot in dashed 
gray line the default CEOl template for the median luminos- 
ity in each bin. Figure [TT] summarizes the mid-infrared fit 
results by plotting for each LIR bin the mean vLv(8)/LIR 
derived from the fitted template. For reference, we add the 
vLv(8)/LIR versus LIR for the local CEOl library templates. 
Again we specify for all redshifts vLv(8)/LIR in terms of the 
z=2.0 MIPS 24 fim flux for the template. 

There is a clear evolution with redshift of the relation be- 
tween rest-frame vLv(8 yum)/LIR and LIR. The local CEOl 
SEDs consistently underpredict the 8 /vm emission at all red- 
shift and luminosity bins. This is the cause of the overestima- 
tion of SFR derived from 24 //m at z>1.5 which was reported 
in previous studies, in comparison to local LIR-SED template 
relations. At z~l (Figure [TOl bottom), the 24 yum filter ob- 
serves rest-frame wavelengths of A > 10 fim where the origi- 
nal CEOl SEDs and the newly fitted ones agree much better 
This is because the region of the 11.3 jum PAH with its un- 
derlying continuum scales closer to linearly with LIR. Hence, 
the derived 24 fim based SFRs were accurate for z<1.5 galax- 
ies. The rest frame 8 fim emission is underpredicted by the 
templates at z~ 1 as well, but it is not covered by the MIPS24 
filter. A 16 jum filter is required to detect at z~l an effect 
similar to the z~2 'mid-infrared excess'. 

Interestingly, for our redshift bins the points in Figure [TT] 
are nearly aligned and we pass a linear fit through them. The 
slopes of the relations for z=2, 1 and (the CEOl original 
library) are nearly identical. The curves for each redshift and 
log(L/L0)~12 are: 

log(vLy{S)/LIR) = 

-0.35±o.o8 log(LIR/lO'^Le) - 0.86±o.o3 : z ~ 2 

-0.32^0.13 log(LIR/lO'^L^) - 0.96^.06 : z ~ I ^ ' 

-0.33log(LIR/lO^^Ls)-l.l3 : z ~ 
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Fig. 10. — Top: Rest-frame vLy/LIR for various LIR bins of 1.5 < z < 2.5 galaxies. Colors and symbols are identical to those used in Figure|5] The original 
CEOl template that is associated with the bin luminosity LIR is plotted as a dashed gray line. Bottom: Same as above for 0.7 < z < 1.3 galaxies. The additional 
IRS 16 fim photometry is colored in cyan. 



One can view this redshift evolution in two ways. The first 
is that the PAH strength vLy(8)/LlR increases with redshift 
for a given LIR, i.e. evolution upwards in Figure[TT] The sec- 
ond is that high redshift galaxies have a similar vLy(8)/LlR 
as lower luminosity local galaxies, i.e. a shift to the right in 
Figure [TT|by ~0.5 dex to z~l and ~0.8 dex to z~2. The re- 
sults of section |5] favor the second description. This in turn 
predicts that the relations described in Equation |5] will flatten 
into the constant main sequence value of log(vLv(8)/LIR)=- 
0.58 at lower luminosities, as also happens with local galax- 
ies. This description via a shift in the LIR associated with a 
given template to lower luminosities as the redshift increases 
is also in line with the results of previous work listed in the 



introduction. Thus, the overestimation of 24 fim SFRs was 
not due to globally enhanced PAH strength at high redshifts, 
but due to a mismatch of template shapes with the associated 
LlRs in the library. Same physical conditions in galaxies of 
a given LIR at all redshifts cannot be assumed. We will dis- 
cuss the relation between the above description (SEDs depend 
on LIR) and our preferred description from Section 2](SEDs 
depend on distance from the main sequence) in Section|9] 

Adapting the CEOl library for use with high redshift galax- 
ies is a simple matter of assigning new luminosities to each 
template. For example, from Equation |5] we get that a CEOl 
template, originally associate with with luminosity Ac£oi, 
should be rescaled to a new luminosity L,=2 with which it is 
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Fig. 1 1 .— Mean log(vL,,(8)/LIR) vs. log(LIR) for z~2 (blue) and z~ 1 (red) 
galaxies. The relation for local galaxies as represented by the CEOl templates 
is plotted in black. The gray dotted hne mai'k the main sequence yL,,(8)/LIR 
value as derived in Section|5] vL,,(8) value is the MIPS24 filter luminosity as 
would be measured for a galaxy at z=2. 



associated at z~2, using the relation: 

log(L,=2) = 0.943 log(Ac£oi ) + 1 -5 1 (6) 

We stress that the negative slopes in Equation|5]are valid only 
in the luminosity range constrained by our PACS data. Lo- 
cally, log(vLv(8 )/L/j?) levels off at ~ -0.58 dex for low lumi- 
nosity objects ( Chary & Elbazll2b01h as illustrated in FigfTTI 
Since low luminosities are not sufficiently constrained in our 
z>l data (16 jim limited) or z>2 data (160 um limited), it 
appears prudent to either not apply Equation |5] below the lu- 
minosities that are constrained at a given redshift or at least 
assume a similar leveling off at \og{vLy{^) j LIR) - -0.58, in- 
dicated by the fading dotted line in Fig[TT] We compare the 
template calibration derived in this section with other calibra- 
tions in Sec 19. 31 

The far-infrared range covered by PACS, i.e. rest-frame 
wavelengths of 20-90 fim in Figure [10] shows a behavior 
consistent with the scaling of the SEDs in the mid-infrared. 
The best fitting templates for the z~2 sample are in the range 
of 11.3 < log(Ac£oi) < 12.1. In practice, these CEOl 
templates have very similar vLv.(/l)/LlR across the rest-frame 
wavelengths probed by the PACS filters and the exact Aceoi 
is degenerate in this luminoity and wavelength range. At 
the highest luminosities (bottom panels), local galaxies would 
typically have a hotter {T^usi ~ 50 K) ULlRG-like rest frame 
far-infrared SED, plotted in dashed line. The observed SEDs 
resemble instead LlRG-hke (log(LlR/LQ)>l 1) SED shapes 
with their luminosity elevated by ~0.4-0.8 dex (see also 
iMuzzin et al. 2010; Rex et al. 2010). At the lower luminosi- 
ties, the 70 and 100 jt/m filters only detect the upper part of 
the population scatter However, when taking into account 
the stacks (in some cases containing much larger number of 
sources than the detections) the mean fits the local LIRG tem- 
plates. At the highest luminosities we detect nearly the full 
sample at all wavelengths and the match to the best fit tem- 
plate is quite striking - almost as if we are sampling the same 
galaxy at different 1.5<z<2.5 redshifts. 

8. AGNSVS. SFG 
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Fig. 12. — Top: Stacked IRS spectrum of four z~2 AGNs (blue). Spectrum 
of another AGN (U4950) is plotted in green. The predicted stacked spectrum 
from the rescaled CEOl templates is in solid black. The templates ai'c fitted 
to the PACS 160 fim flux, not to the spectrum. Bottom: The location of the 
AGN hosts with respect to the main sequence. U4950 is plotted as a green 
star. 



In all our samples thus far we have removed all sources 
detected in the Chandra 2 Ms surveys in GOODS-N and 
GOODS-S. The stacked IRS spectra of the SFGs (Section|6ll 
clearly show mid-infrared emission dominated by PAHs typ- 
ical of star formation and no significant emission from ob- 
scured AGNs. We now turn to inspect the mid-infrared emis- 
sion of the X-ray sources. At redshifts of 2 and with the depth 
of the 2 Ms catalogs, it is safe to assume that all the X-ray 
sources detected by PACS 160 //m are AGN hosts. Only a 
small fraction of the highest LIR sources are suspected to 
have their X-ray emission dominated by star formation (see 
Section |2l), which does not rule out the possible presence of 
an AGN. 

In Section|5]Figure|7]we compared the LIR as derived from 
24 yum to the LIR derived from 160 yum. In the figure we also 
plotted the X-ray AGNs of our sample (black x marks) and 
sources that show a clear power-law SED in the 3.6-8.0 jum 
Spitzer-IKAC bands (red circles). If AGN-heated dust con- 
tributed significantly to the mid-infrared emission, we should 
expect an enhanced L24/L160 for these galaxies, due the mid- 
infrared continuum emission from warm circumnuclear dust. 
This would lead to wrong extrapolation from mid- to the far- 
infrared and overprediction of the luminosity. In the figure, 
most AGNs are well within the scatter of the normal SFGs. A 
small bias does exist and AGN hosts show a small statistical 
enhancement in ~8 yum flux. In only 2 out of 18 AGN-hosting 
galaxies the mid-infrared emission is significantly enhanced 
compared to the typical scatter for the SFG in the sample. 
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The lFadda etaTI (120101) IRS sample includes 5 z~2. X-ray 
AGNs which are detected with PACS 160 fim. These were ex- 
cluded from the analysis in Section |6] The individual spectra 
are quite noisy. However, when normalizing by each galaxy's 
LIR, four of the spectra overlap within the noise, while one 
(U4950) shows significantly brighter mid-infrared emission. 
In Figure [12] we plot the stacke d spectrum of the four z~2 
X-ray detected sources from the ^ Fadda et al.l (1201 0) sample. 
We use the same stacking (spectral averaging) procedure as 
in Sec|6] The spectrum of U4950 is plotted separately. Over- 
plotted is the expected mean spectrum from the CEOl tem- 
plates, rescaled according to the results from Section |5] and 
stacked in a similar way as the spectra. The templates which 
were scaled according to the non-AGNs also match the X-ray 
sources quite well. In particular, there is no enhanced contin- 
uum level which may be attributed to AGN-related circumnu- 
clear dust emission. The stacked AGN mid-infrared spectrum 
shows all the typical features of a SFG and the r elative AGN 
contribution must be low, as also concluded by iFadda et aLl 
(l2OT0l) . 

The spectrum of U4950 is significantly different and hence 
was not included in the stack. In addition to having 
much higher rest mid-infrared fluxes than expected from a 
SFG, which as j udged from the LIR lies exactly on the 
iRodighiero et al.l (120101) main sequence (Figure [T2] bottom 
panel), it seems to be flatter and elevated by an added con- 
tinuum rather than by an enhanced PAH emission. The SFG 
template expected for U4950 according to its mass and LIR 
is plotted as a dotted line in Figure [T2l scaled up by a factor 
3.6 to match its observed 24 fim flux. The U4950 spectrum 
is fundamentally different from such a scaled SFG template 
and is dominated by AGN continuum which is evident short- 
ward of 6 Aim, wher e AGN continua can be cleanly identified 
(iLaurent et al.ll2000h . 

This behavior is consistent with the X-ray properties. We 
convert from L^(2-10 keV) to boloni e tric lu minosity Lagn 
us ing the relation of iMaiolino et aTl (l2007i) and a factor 
7 (iNetzer & Trakhtenbrod 120071) conversion from optical to 
bolometric luminosity: 



log(LAGN) = 



log(L,)- 11.78 
0.721 



+ log(7) 



(7) 



The IR, X-ray, and AGN bolometric luminosities are sum- 
marized in Table |4] U4950 has a significantly higher 
log(LAGN/LIR) than the next highest ratio by more than 
0.8 dex and more than 2 dex higher than the rest of the AGNs 
in this sample . 

Bau er et al.l (HoiO) derived the following relation between 
mid-infrared and X-ray AGN luminosity: 



log 



(,ergs-i 



= 1.21 +0.06 -log 



i2-10keV 

erg s"' 



-8.7 + 2.6 (8) 



This relation has been derived for much more luminous AGNs 
than the o nes in our sample, but it is also consistent with the 
results of |Lutz_eLal .^ (2004) which were derived for low lu- 
minosity AGNs. Combining this relation with equation |5] for 
z~2 and assuming that the AGN continuum is roughly flat in 
vFy between 5.8 and 8 jum, we get the condition that in order 
for a z~2 AGN to dominate the emission close to rest frame 
8 /im, the relation between the X-ray luminosity and the LIR 
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IRS SOURCES FROM THe IFaDDA ET AL.I 420101) SAMPLE WITH AN X-RAY DETECTION. 



ID 


log L2-10kcV^ 


log LAG>i 


log LIR^ 


log vLv(MIPS24)^ 




[erg s"'] 


[erg s"'] 


[Lo] 


[Lo] 


U428 


<42.43 


<43.36 


12.07 


11.40 


U4642 


42.54 


43.51 


11.86 


11.42 


U4950 


44.28 


45.91 


11.98 


11.93 


U5775 


<42.54 


<43.51 


12.16 


11.28 


U5877 


43.96 


45.47 


12.37 


11.47 



'■Faddaetal. (2010) 
''Equation|7] 
'^This work 



must satisfy: 



log 



lOkeV 



ergs- 



0.5371og(^) + 19.71 log(^)>11.3 
0.8261og(^) + 6.71 log(^)<11.3 

(9) 



Or in terms of bolometric AGN luminosity (equation |2): 



log 



^agn \ ^ I 0.745 log (^) + 11.84 log(^) > 11.3 
ergs-ij~| 1.1451og(^)-6.18 log(^)<11.3 

(10) 

The condition for log(LIR/LQ)<l 1.3 assumes that 
vLv(8)/LIR=-0.58 below this luminosity, as suggested 
in Section |7] A similar, albeit more complicated relation 
that involves Alog(SSFR)MS can be derived by combining 
Equation[8]with Equation[3] 

Overall, for our sample where log(LIR/L0)~12, an AGN 
bolometric luminosity of log(LAGN/erg s"')>45.8 or X-ray lu- 
minosity log(L2-i()kev/erg s"')>44.2 is required to dominate 
the 8 yum emission. Only U4950 reaches these AGN lumi- 
nosities in the IRS sample used here and indeed shows a 
clear excess in its 24 /vm flux. In general, such high AGN 
luminosities are rare in the GOODS fields which explains 
why most of our (far- infrared bright) AGNs are indistinguish- 
able from the SFGs in their mid-to-fa r IR emission (see also 
'Nordon et al.l l2010HElbaz et al.ll2010 '). The changed slope at 
log(LIR/LQ)<l 1.3 corresponds to a population that is domi- 
nated by galaxies on or below the main sequence. Here, the 
8 jum luminosity associated with the star formation decreases 
proportionally to LIR (constant vLv(8)/LIR), while above that 
threshold the 8jum luminosity will vary more slowly with LIR 
due to the changing mid- to far-infrared SEDs. The ratio of 
AGN luminosity and LIR at which the AGN dominates the 8 
^va e mission hence c learly depends on sample selection. 

iFadda et al.l (120101) fitted and subtracted the PAH emission 
from the stacked spectra of all the IRS sources for which the 
mid-infrared spectra do not appear to be dominated by an 
AGN. They were thus able to constrain the mean AGN con- 
tribution to the 6 yum continuum and derived a mean intrinsic 
AGN L2-i()kev = 1 X 10"*^ erg s"'. According to the above 
analysis, such luminosities are nearly two orders of magnitude 
lower than the AGN luminosities required in order to create a 
true broadband 24 yum 'excess' in massive (~ 10" M©), z~2, 
main sequence galaxies. Given that on average AGN hosts 
seem to have similar SFR as non-active massive galaxies at 
same redshift (Sh ao et al. 201 0), this will likely still be true 
for many less far-infrared luminous objects, but a higher frac- 
tion of quenched sources with 24 /vm emission dominated by 
AGNs may be expected. 
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9. DISCUSSION 

9.1. The infrared SED reflects ISM conditions rather than 
total LIR 

Our findings from the previous sections can be summarized 
as follows: The SED shape and in particular vLv(8)/LlR cor- 
relate best with the offset from a sloped reference line in the 
SSFR(M,) diagram. Over our limited mass range, the depen- 
dence on the exact main sequence slope is weak - it is the 
normalization that is more important. The redshift evolution 
of SEDs in this picture is driven by the shift of the main se- 
quence to higher SSFR at higher redshift. The SEDs also cor- 
relate (slightly worse and with explicit redshift dependence) 
with the absolute LIR, and the redshift evolution is manifested 
here as a shift in the LIR associated with each template shape. 

If infrared SED shape is tied to a typical local radiation 
field intensity (e.g. lDale et al.ll200TI) . which in turn is related 
to the efficiency SFR/Moas (SFE) by which molecular gas is 
converted to stars (the number of young stars per gas mass 
available to irradiate it), then it will also be linked to the 
SSFR, since SFE = SSFR ■ MJMc-is- Because both gas frac- 
tion McasAMcas+M*) and main sequence SSFR rise towards 
higher redshift, the increase in gas fraction and the increase 
in SSFR will partly compensate and SFE will be more closely 
linked to the SSFR offset from the main sequence. SFR/Mcas 
is lower (by factors 4-10) in z~2 high redshift main sequence 
galaxies, compared to local U LIRGs with simila r SFR but 
lower total gas fractions (e.g., iGenzel et al.ll201(j|) . At z~0, 
radiation field intensity, SFR/Mcas and SSFR offset are em- 
pirically correlated with LIR, largely driven by the peculiar 
properties of merger driven (U)LIRGs and closely linked to 
their compactness, as al so reflected in the local compact- 
ness/temperature relation (iChanial et al.l2007b . The larger gas 
fractions at higher redshifts permit larger LIR before invok- 
ing special events like mergers and will shift correlations with 
LIR. The preferred connection between SED shape and SSFR 
offset from the main sequence reflects that the infrared SED 
shape is physically linked to the local ISM conditions, and 
only indirectly to LIR. In line with these arguments, sizes of 
equivalently luminous IR galax ies change to higher reds hift, 
with implications on their SED (Ruiopa kam et al.ll201 la 'lb^. 

Far infrared fine-structure emission lines and in particu- 
lar [CII] exhibit a similar behavior to the PAHs. Compact 
luminous objects with intense radiation fields such as local 
ULIRGs show a [ CII] deficit relative to the far-infrared con- 
tmuum emission jMalhotra et all 1 19971 l200Tt IContursi etHl 
|2P02; Luhman et al.^2003) wit h argu ments for a direct phys- 
ical link to PAH jHelou et al J 120011) . The [CII] line with a 
wavelength of 158 fim is much less likely to be attenuated 
than the mid-infrared PAHs and the deficit is intimately re- 
lated to the radiation fields around the star forming regions 
and the resulting structure of the HII and photo-dissociation 
regions (PDRs). While there are no [CII] observations for 
our specific targets, the qualitative findings for [CII] both lo- 
cally and at high redshift are consistent with the behavior of 
vLv(8)/LIR. There is a high-z [CII] deficit - compared to lo- 
cal it is setting in at higher LIR, but at similar ratio of LIR 
and gas mass (SFE), equivalent to similar main sequence off- 



set (IGracia-Carpio et al. 2011). This is directly analogous to 
our finding of a relation vLv(8)/LIR to LIR that evolves with 
redshift, but a single relation with main sequence offset. The 
connection between SFE and offset from the main sequence 
which was described above, together with the analogy be- 
tween the [CII] deficit and decreasing vLv(8)/LIR in galaxies 



above the main sequence, suggest that the two observed phe- 
nomena are likely related and intense radiation fields to be the 
dominant cause of the PAH weakness. 

To zeroth order, the redshift dependent main sequence 
can be seen as a reference and with the distance from 
this reference a number of observables change together: 
The SSFR (by definition), t he star formation efficiency 
SFR/Mcas (Gen zel et al. 120101). mo rphologv and compact- 



ness (Wuvts et all 120 11 at 



Elbaz et al. 2011) the mid- to far- 



infrared SED (this work, lElbaz et al.ii201 li) . the far-infrared 
SED shape tracing the large grain temperature (Elbaz et al] 
120111). and far- infrared fin e-structure emission lines deficits 
( iGracia-Carpio et ani201 ll) . Compactness due to mergers is 
likely a key factor in changing the local conditions in the ISM 
that drive the scaling relations for these observables. The link 
of low vLv(8)/LIR mid- to far-infrared SED shape with spa- 
tial compactness, previously indicated from local universe ev- 
idence mentioned in the introduction, is strongl y supported 
both a t low and at high redshift in the analysis of lElbaz et al.l 
(|20T1 . 

9.2. A simple model connecting cloud conditions and global 

SED 

Galaxy-integrated LIR is not a fundamental parameter that 
determines the conditions in the molecular clouds and PDRs 
which produce the IR emission. Rather, it has an empiri- 
cal correlation with these conditions. Borrowing the terms 
from thermodynamics, a clear distinction should be kept be- 
tween total extensive quantities which are summed over the 
entire galaxy (LIR, stellar and gas masses . . . ), and intensive 
quantities which are averaged, such as SSFR, SED shape, gas 

depletion time scale It is easy to imagine a galaxy in 

which everything is doubled (every cloud and star becomes 
two etc.): extensive quantities like total LIR will be doubled, 
but intensive ones such as SED shape and SSFR will remain 
unchanged. The relations between extensive and intensive 
quantities will change in such a scaling. The SED shape as an 
intensive quantity that is related to local cloud physics should 
more directly scale with another intensive quantity such as 
the SSFR or its offset from the main sequence. The specific 
star formation rate at a given redshift, which is proportional 
to LIR/M», offers a natural scaling reference: LIR and M» are 
measurable and their ratio does not change when scaling as 
described. 

We adopt the simplified hypothesis that main sequence 
galaxies at all of our redshifts form their stars in a single 
type of star forming molecular clouds (which for brevity, we 
will simply refer to as clouds) and discuss the implications 
for our findings as well as other scaling relations below in a 
toy model. Main sequence galaxies with different numbers 
of these clouds will thus have similar mid-to-far IR SED but 
a different LIR. Galaxies above the main sequence will have 
different local cloud properties. 

If the number of the clouds were proportional to the stel- 
lar mass, more stars would mean proportion ally more of the 
same c louds and LIR emission. Our adopted Rodighiero et al.l 
(l2OT0l) main sequence argues instead that the number of 
clouds scales less than proportional with the stellar mass when 
moving al ong t he main sequence. This agrees with the finding 
in Section [23] that galaxies with constant vLv(8)/LIR are not 
at constant SSFR, but have a SSFR(M») dependency. 

Comparing main sequence galaxies of similar mass at dif- 
ferent redshifts in this picture, we expect more of the same 
clouds per stellar mass at higher redshift. If similar stellar 
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masses imply roughly similar galactic radii, then the high- 
z galaxies have similar molecular clouds with less empty 
volume between them, meaning a higher surface brightness. 
Higher surfa ce brightness of main sequence galaxies is indeed 
observed by lWuyts et alJ (l201 l a). 

We explore the scenario of a single type of star forming 
cloud in a toy model. One can define a dimensionless effi- 
ciency parameter for star formation as: 



PGas 



(11) 



where p* is the SFR per volume element, pcas is the gas 
density and tjj is the gas free fall time scale (t^j oc ''^) 
of a gravitationally bound cloud, e is determined by the 
micro-physics inside the molecular cloud and we will as- 
sume it to be a global constant. This translates t o the local 
Kennicutt-Sc hmidt (KS) relation ( Schmidt 195^ ; iKennicutj 
[T989llT998bl) : 

p. oc ep" (12) 

where we assume p* and p^ to be averages over the star form- 
ing cloud (as opposed to the entire galaxy volume), indicated 
by the subscript c. Traditionally a = 3/2, however due to vari- 
ous measurerrients deriving values in the range of 1 <a < 1-7 
dBouche et al."'2007'; ' Kennicutt et 301200 7': 'Bigiel et al.J|2008t 
iGenzel et al. 2010) we leave this as a general parameter. The 
free-fall timescale is only relevant for a gravitationally bound 
cloud set to collapse and form stars, not for the averaged den- 
sity over the entire galaxy. Hence, the above is a local relation 
applicable to a small-scale molecular cloud entity. We would 
like to integrate over the galaxy volume to derive the total, ex- 
tensive measurables: SFR and mass. To do this, we define a 
filling factor / that describes the fraction of the total volume 
V of the galaxy which is occupied by molecular star forming 
clouds. The integrated SFR over the volume V is then: 



(13) 



We can now divide by the stellar mass M» and use PcfV 
(MG2s/Mt)Mt to eliminate the volume and get: 



(Ma 



(14) 



where Mcas is the galaxy integrated molecular gas mass. 

The right-most term in parenthesis, that includes the aver- 
age cloud density and the efficiency parameter, describes the 
local conditions in the star forming regions and hence is as- 
sociated with the IR SED shape. All main sequence galaxies 
will have the same value in these parentheses, and also the 
same SED. Galaxies above the main sequence will boost the 
M,/M, by changed cloud conditions, going along with SED 
and other changes. 

From Equation fl4l galaxies with the same IR SEDs will 
tend to have similar SSFR, modified by their gas fractions. 
On a SSFR versus M, diagram, such constant-SED galax- 
ies (indicated by constant vLv(8)/LlR in this study) will 
lie on a slope which follows the slope of the Mcas/M, re- 
lation . Comparing different redshifts, changing gas frac- 
tions (iTacconi et al.l l2010) would mediate in Equation fT4l the 
change of main sequence SSFR with redshift. This change 
is needed to have same cloud conditions and SED at differ- 
ent SSFR, but at the same SSFR offset from the redshift- 
dependent main sequence Alog(SSFR)Ms- 



In local galaxies, molecular gas fractions decrease gently 
with st ellar mass, though with a large scatter (Sainton ge et al.l 
1201 lah . The slope of the local main sequence (in SSF R ver- 
sus Mt) is very similar (^rinchmann et al. 2004; Sali m et aP 
'2007; Peng et al.'2010'), in full agreement with the toy model. 
At higher redshifts, the dependency of M pas/Mt on M, is yet 
somewhat unclear. iTacconi et al.l (120101) measured the molec- 
ular gas fraction in a sample of z~l and z~2 galaxies lying 
close to their respective main sequence. Their sample size 
and spread in M, is not yet sufficient to conclusively probe for 
trends of the gas mass fraction^ith stellar mas s. Sample size 
also limits the conclusions of jPaddi et all f201()bl) who find a 
near constant Moas/M* for six z~1.5 main sequence galaxies. 

Another way to look at equation [14] is to divide both sides 
by the gas to stars mass ratio to get a dependency on the gas 
depletion timescale Tdep = Mc^^/Mt instead of SSFR: 



'dep 



epc 



(15) 



Tdep is an intensive quantity that is directly related to the 
process of star formation, unlike SSFR whi ch involves the 
mass of older stars. For local galaxies, Saintong e et al.l 
(201 lb) find a relatively tight correlation between Tdep and the 
SSFR across two orders of magnitude in SSFR: logijdei^ - 
-0.724 log(SSFR) + 1.54. Interestingly, when they scale the 
SSFR by the change of the main sequence with redshift, the 
relation agrees with z~l and z~2 SFGs, though the statis- 
tics are low. In Equation [15] the measureable quantity Tdep 
is equivallent to local cloud conditions and hence closely and 
directly related to the resulting IR SED. If indeed Tdep corre- 
sponds to the resulting IR SED shape, then this supports our 
conclusion from SectionQthat the relation between SSFR and 
SED shape scales with redshift like the main sequence. At a 
given redshift, the decrease of Tdep with increasing SSFR rep- 
resents the change in molecular cloud properties as galaxies 
move away from the main sequence towards 'bursty' compact 
sources. 

The above suggests that the main sequence up to z~2 is 
composed of galaxies with rather uniform molecular cloud 
properties. The difference in integrated quantities such as 
LIR is mostly due to the total number of clouds. For nor- 
mal galaxies, dust optical depths are low at the mid- to far- 
infrared wavelengths studied here. Observables will thus be 
less sensitive to second order effects of arrangement of these 
clouds than shorter wavelengths. As we gradually move away 
from the main sequence to higher SSFR the properties of the 
clouds gradually change and (epc""') increases, meaning Tdep 
decreases and the vLy(8)/LIR decreases, representing denser 
star forming regions. In this toy model we cannot differentiate 
between an increase of the average gas density of the clouds 
Pc or the efficiency e. 

The global KS law Es fR oc uses the integrated molec- 
ular gas-mass surface density (total Mcas over area) and in- 
tegrated SFR surface density. When considering only main 
sequence galaxies or galaxies parallel to the main sequence 
in general, the power expected to be retrieved is a 1 in- 
stead of the theoretical a - 1 .5 or in general the a of the lo- 
cal KS law (Equation [T2I). which could partially explain low 
a values quoted in recent works that distinct between types 
of galaxies and atomic versus molecular gas masses (e.g. , 
iGao & Sol omonll2004t iBigiel et aUlIOOSt iLerov et al.ll200l 
Genzel et a i MioT This is because according to the above 
suggested model and along lines of constant Alog(SSFR)MS, 
the increase in the surface densities is due to filUng of the 
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empty volumes by more of the same molecular clouds, in- 
creasing mean surface densities for SFR and for (molecular) 
gas proportionally, a = 1 .5 may still apply to the individual, 
or even fragments of molecular clouds (local law), if it were 
possible to measure it on small enough scales. The various 
measured values for the power in the KS law highly depend 
on the definitions, methods and selections used. For a discus- 
sion on the various measurements of the KS slope see e.g., 
iGenzel et all dMol) . 

How does equation [14] relate to the correlation of SED 
shapes with LIR? This is the original way in which the CEOl 
library has been constructed for local galaxies and the correla- 
tion is also seen in high-z galaxies (Section]?] Figure fTTTi. As 
an extensive quantity, LIR must be correlated with the clouds 
that emit the SED through other parameters or circumstance. 
For example, locally, luminosities of L1R> 10'^ require 
either a very massive main sequence galaxy which has not 
turned passive (rare according to the mass function), or in fact 
a lower mass but with a boosted SSFR due to some special 
event (like mergers). In such a case, LIR becomes correlated 
with the cloud properties because LIR selected sample get in- 
creasingly dominated by galaxies above the main sequence at 
high LIR. The global shift to higher LIR in the relation be- 
tween SED shape and LIR for z=l-2 galaxies (Figure [TTIl is 
in the context of the toy model's Equation [14] attributable to 
the increase of gas fraction with redshift for galaxies both on 
and above the main sequence. 

How do the two correlations SED-LIR and SED- 
ASSFRms fit together? A sloped main sequence produces this 
naturally: Over a limited mass range, a selection by LIR (-1 
slope in SSFR(M,)) will then produce a similar selection to 
one by ASSFRm^ from a sloped main sequence. As stated be- 
fore, in the SSFR versus mass diagram we associate the slope 
of constant vLv(8)/LlR galaxies with the slope of the main se- 
quence. Both slopes are not well determined and it is possible 
that this association, within the large uncertainties, is inciden- 
tal. In practice, we measured ASSFRms from a reference line 
that we fixed. The more important aspect is that the reference 
line we chose scales with redshift like the main sequence. 

The toy model, capturing cloud properties into pc which is 
assumed to be identical for all main sequence galaxies, is ad- 
mittedly simplifying. Nevertheless, it is able to capture the 
essence of the link between SEDs and SSFR offset from the 
main sequence, and to motivate scalings with redshift that are 
related to gas content. Further insight will come from testing 
its assumptions, since pc may not uniquely define the kind of 
clouds in the galaxy and not have a 1:1 link to SED. Differ- 
ent cloud structures could produce a similar pc, but different 
SED. Metallicity and non-linear effects can also play a role, in 
particular the latter when the local filling factor of the molec- 
ular gas approaches / ~ 1 and the clouds start affecting each 
other While being a sensitive tracer to the conditions in the 
star forming regions, yLv(8)/LlR can be affected by many dif- 
ferent physical conditions of PAH and continuum emission, 
grain size distribution and the detailed chemistry. Attenuation 
in the mid-infrared in extreme environments, such as those 
encountered in local ULlRGs can play a role in these special 
cases. Therefore some scatter is to be expected in the cor- 
relation between vLv(8)/LlR and the distance from the main 
sequence, LIR, gas depletion time, and other quantities. 

9.3. Comparison of 24 jim-based SFR estimates adopting 
different mid- to far-infrared SED calibrations 



In Sections ]5] and ]?] we derived two calibrations for the 
24 jum photometry as an SFR indicator at redshift ~2. Both 
produced similar ~0.4 dex scatter in L1R(24)/L1R around z~2 
when fitting our data with the modified templates. The cali- 
bration by SSFR is applicable over a wide redshift range with 
implicit redshift dependency, whereas the one by LIR is red- 
shift specific. While we have presented evidence for a smooth 
variation of vLv(8)/LlR with SSFR offset above the main se- 
quence which we consider to be the correct physical picture, 
one can ask about the practical viability of other approaches. 

In the following, we discuss systematic deviations as well 
as the scatter of derived L1R(24)/L1R for our z~2 data, ap- 
plying our relations as well as ones from the literature. We 
assume only 24 fj.m flux, redshift, and stellar mass are known. 
The latter is used only for methods invoking the SSFR. In 
the summarizing Figure [13] we look at the ratio of total in- 
frared luminosity as derived from MIPS 24 fluxes L1R(24) to 
the PACs-based LIR, for the 1.5< z<2.5 SFGs in our sa mple. 
We compar e flie method s of CE O 1 , iMurphv et al] (1201 1[ Mil 
hereafter), iWuvts et al.l (l2008l W08 hereafter), lElbaz et al.l 
(1201 IL Ell hereafter), and our calibrations by LIR (Equa- 
tion ]5]l and by Alog(SSFR)MS (Equation [3]l. Note that the 
X-axis in this figure is the LIR as derived by PACS (la- 
beled LIR(160) for clarity) and not LIR(24). Plotting versus 
LIR(24) would tend to produce positive slopes even for a per- 
fect calibration. This is because in the presence of noise and 
a steeply declining luminosity function, the highest LIR(24) 
will tend to be dominated by up-scatter of lower luminosities. 
In each panel we show the results for the individual sources in 
our sample, the mean offset of LIR(24)/LIR and the standard- 
deviation around it, and finally the slope of a linear fit that is 
also overplotted, highlighting any systematic biases. 

The intrinsic scatter in vLv(8)/LIR within the galaxy popu- 
lations seems to be significant even after binning by ASSFRms 
(Figure ]5]l, as well as by LIR (Figure [TOl l. see also Figure ]3] 
We estimate this scatter to be abou t ~0.2 dex, Icr, consistent 
with findings by ( lElbaz et al.ll201 Ih . For the purpose of using 
rest frame 8 fim for the estimation of LIR, this intrinsic scatter 
sets basic limits to the achievable accuracy of any of the con- 
versions. An indirect effect of this scatter arises in calibrations 
for which vLv(8)/LIR varies steadily with LIR or with main 
sequence offset, similar to our calibrations and CEOl (where 
LIRcBoi vLyiSy-^). The scatter in the derived LIR(24) will 
then be nonlinearly amplified. For example, LIR(24) for a 
source with a higher than average vLy(8)/LIR (for its LIR) 
will not only be overestimated proportional to vLy(8), but in 
addition because a template with lower vLv(8)/LIR will be 
applied. Finally, the scatter will also blur any systematic defi- 
ciencies in the conversion methods themselves. 

The CEOl calibration (Figure [13] top left) produces a clear 
systematic offset for the reasons discussed throughout this pa- 
per There is no clear trend with luminosity, and the ~0.36 
dex scatter is relatively high because of the nonlinear 'ampli- 
fication\ 

Mu rphy et al.l (l2()l Ih . on the basis of mostly the data of 
Murp hv et al.l i 2009ir suggested an empirical correction to the 
LIR as estimated when using the original CEOl template li- 
brary. This correction assumes that any LIR< 10'^ Lq as ob- 
tained when using the CEOl templates are accurate, but ap- 
plies a luminosity dependent correction when the fit results in 
LIR> 10'^ Lq. This method does not change the templates 
themselves, but instead is applied to the LIR, after it is de- 
rived using the original CEOl library. This Mil correction 
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by a power 0.6 cancels out most of the non-linearity built into 
the CEOl library, hence also reduces the associated amplifi- 
cation of scatter. The calibration of Ml 1 effectively removes 
the excess seen in the CEOl L1R(24) for the L1R~ 10'^^ Lq 
where it was constrained, but the luminosity of galaxies with 
L1R< 10'^ Lq still tends to be overestimated. It thus intro- 
duces a trend with the real LIR (Figure [13] top right). Note 
that the lower luminosity galaxies used by Mil to constrain 
their correction are at lower redshifts, plausibly explaining 
why systematic offsets remain for low L1R< 10 Lq galaxies 
at z~2. 

Wu yts et alJ (1201 lb^ suggested a unique luminosity- 
independent conversion fro m 24 tim to L IR. The conversion 
is based on a template from IWuvts et al.l dlOOS). This tem- 
plate has log(vLv(8)/LIR)=-0.92 with the M1PS24 filter at 
z-2. This is also the median value of the GOODS-S sample 
used in this work. This correction produces for our sample a 
very good overall correction (small mean L1R(24) excess), but 
with a clear overall trend with LIR. The luminosities derived 
by this conversion method are overestimated at low LIR and 
underestimated at high LIR. (Figure [T3]middle left). The scat- 
ter is low because intrinsic scatter in vLv(8)/LlR is not ampli- 
fied by the method - errors and real variation are propagated 
linearly. Given our results, both such a residual trend and a 
moderate scatter will be a feature of correction recipes that 
are adopting only a single template. In principle, such a sin- 
gle vLv(8)/LlR conversion is in contradiction to the clear LIR 
or SSFR dependence that we find in this study. In practice, 
the accuracy of such a conversion depends on the selection 
of the sample to which it is applied, and will develop biases 
when selecting mainly low luminosity (main sequence) galax- 
ies or very luminous galaxies (high above the main sequence). 
Since it does neither encode main sequence off'set nor a red- 
shift dependence, it can also create redshift-dependent biases. 
Main sequence normalization and flux limit of many surveys 
scale with redshift in a roughly similar way, however This 
will reduce such effects in practice. 

Figure|6]suggests that a replacement of the continuous SED 
calibration with SSFR by two discrete values for sources on 
and sources above the main sequence (the approach taken by 
lElbaz etal.ll2011 ) can potentially perform well as a two step 
approximation to the trend. The change in vLv(8)/LlR be- 
tween main sequence and sources with ~0.5 dex higher SSFR 
is similar to our ~0.4 dex scatter around the calibration. Such 
a treatment may be in particular applicable if comparing sam- 
ples on the main sequence with merger dominated samples 
and would directly correspond to the two modes of star for- 
mation seen in CO based studies of the Kennicutt-Schmitt law 
(iGenzel et al.l2010HDaddi et alJ2010al) . While we do not find 
such a corresponding bi-modality in the vLv(8)/LlR relation, 
it is beyond the scope of this work to study to which extent the 
smooth trend in Figure |6]is due to smooth changes of condi- 
tions within individual galaxies versus a varying mix of 'nor- 
mal' galaxies and 'mergers' in a given SSFR bin. 

The middle right panel of Figure[T3]shows the result of ap- 
plying the Ell method to our sample. We have adopted here 
the main sequence parametrization of El 1 (mass independent 
in SSFR) and their approach of separating main sequence and 
'starbursts' at SSFR a factor 2 above the main sequence. The 
two templates in Ell were in practice derived by separat- 
ing the galaxies in values equivalent to vLv(8)/LlR and not 
strictly in SSFR. Without prior knowledge on the SFR in the 
galaxy, the selection of the correct template can sometimes be 
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Fig. 13. — Comparison between various template sets and conversion meth- 
ods to derive LIR from a single 24 jjm flux measurement. The methods 
correspond to Charv & Elbaz (2001, CEOl), iMurphv et al.. (2011, Mil), 
Wuyts etal. (2008, W08), Elbaz et al. (201 it Ell) and the bottom panels 
correspond to the calibrations from Section^ (left) and Section [5] (right). 
The markers and colors used in Ell correspond to the assumed template: 
blue circles for main sequence, red squares for starburst, green triangles for a 
combination of both. The green hne in each panel is a simple linear trend. 



ambiguous, with permitted solutions for both templates at a 
given 24 fim flux and redshift. The L1R(24) difference then is 
~0.25 dex between the two. These cases are marked as green 
triangles in Figure [13] and we adopted the mean LIR from 
the two templates. The application of the El 1 method to our 
sample removes the bias and shows low scatter, but leaves a 
residual slope. In these properties, practical results from this 
two template SSFR-based approach resemble the single W08 
template method. 

The template calibrations derived in this work produce neg- 
ligible trends and biases with LIR and Alog(SSFR)MS, but a 
scatter which is 1.2-1.5 times larger than the other methods 
(Figure[T3]bottom). Much of the increase in scatter is related 
to the non-linearity in the conversion from 8 yum flux to LIR. 
This non-linearity in CEOl, that also prevails in our calibra- 
tions and is represented by the strong negative slopes seen 
in Figure [TT] means that a scatter in vLv(8) is super-linearly 
enhanced by a power of -1.5 when converting to LIR. With 
respect to a single template, the improvement by a more de- 
tailed vLv(8)/LIR versus LIR relation removes bias, but is 
too small to compensate for scatter in vLv(8) and its non- 
linear increase in the conversion to LIR. When using the tem- 
plate calibrations by Alog(SSFR)MS (Figure [6] Equation O 
the non-linearity is even higher, with a power of ~2. De- 
viations in vLy(8) can be due to the photometric errors and 
due to the intrinsic scatter in the population. In our sam- 
ple both have a significant contribution to the overall scat- 
ter When requiring knowledge of the masses (as when using 
Alog(SSFR)Ms), ~0.2 dex errors in the estimated masses are 
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an additional source of variation. Our Alog(SSFR)MS calibra- 
tion introduces a higher degree of non-linearity, that increases 
the other sources of variation by larger fact ors th an for other 
methods. This makes the findings from Sect. l3.1l of best corre- 
lations with Alog(SSFR)Ms, consistent with the larger scatter 
in LIR(24)/LIR reported here. 

Perhaps surprisingly, for 24 /im-based LIR derivations a 
simplified linear conversion will often produce less scat- 
ter in the resulting luminosities than when using LIR or 
Alog(SSFR)MS dependent templates, even though the latter 
methods better describe the true relation and remove bias. The 
difference between the methods and in particular between lin- 
ear and non linear methods will vary depending on the sam- 
ple selection and the various sources of noise (photometry, 
redshifts, masses). Any attempt of correcting for the sloped 
trends seen in Figure [13] will inevitably include a non-linear 
scaUng that will trade reduced bias for increased scatter 

10. CONCLUSIONS 

We have obtained deep 70, 100 and 160 yum Herschel-PACS 
photometric maps of the GOODS fields as part of the PEP 
project. Using this data we study the relation between 8 //m 
rest-frame emission and the total IR luminosity vLv(8)/LIR of 
z~l and z~2 galaxies. The deep far-infrared observations al- 
low us for the first time to reliably measure the total infrared 
luminosities of normal star forming galaxies on the z~2 main 
sequence with little need of extrapolation or stacking. We 
studied the typical SED shapes by binning the galaxies ac- 
cording to their ASSFRm^ distance from the main sequence, 
as well as by LIR, We found the SED shape that best de- 
scribes the population using a photometric mean SED fitting 
method (Section |4]i. For a subsample of 16 sources at z~2 
and 9 at z~l we tested the results from the photometric fit 
against stacked mid-infrared spectra and found a good match 
between the corrected templates and the spectra in the range 
6-12 fim. Finally, we examined a sample of X-ray AGNs 
which we compared to the SFG population. 

Our main findings are summarized below: 

• The excess in 8 ijm emission with respect to LIR for 
galaxies at z>1.5, reported in earlier studies, can be at- 
tributed almost entirely to enhanced emission of PAHs 
with respect to the locally-calibrated templates that 
were used. This is verified using a combination of 
LIR fromi/ersc/ie/-PACS and stacked deep Spitzer-IRS 
spectroscopy. No continuum component from an ob- 
scured AGN is typically required to explain the mid- 
infrared emission of far-infrared bright, star forming 
galaxies with log(LIR/L0)>l 1.5. A similar effect can 
be observed with a 16 fim filter at z~l (8 fim rest- 
frame). 

• vLv(8)/LIR correlates with both the absolute LIR and 
the ASSFRms distance from the main sequence of star 



forming galaxies. The latter gives a tighter corre- 
lation without ex plicit redshift dependence (see also 
lElbaz et ani201 1*). We derive calibrations for both rela- 
tions which we implement on the CEOl template library 
as an example. 24 fim can then be used to estimate LIR 
in 0.7<z<2.5 galaxies using the rescaled templates. The 
scatter is about a factor of 2.5 in our sample, but with 
no bias or trends with luminosity. Calibrating the tem- 
plates by LIR, z~l and z~2 galaxies have SEDs similar 
to local galaxies with log(LIR) lower by 0.5 dex and 
0.8 dex respectively. 

• The correlation of the SED shape with the distance 
from the redshift-dependent main sequence suggests 
that star forming molecular clouds in main sequence 
galaxies have similar physical conditions at all redshifts 
z<2.5, with only their number and filling factor chang- 
ing along the main sequence and between redshifts. On 
the main sequence, log(vLv(8)/LIR)=-0.58 when the 8 
yum flux is measured through the MIPS 24 yum filter 
at z=2. For higher SSFR above the main sequence, 
vLv(8)/LIR decreases with increasing ASSFRms . This 
can be interpreted as more compact star formation, 
as suggested by Elbaz et al. (2011), in a way consis- 
tent with scaling trends observed for other interstellar 
medium emissions. 

• The majority of X-ray AGNs in our sample have mid- 
infrared emission which is completely dominated by 
the star formation related emission (mainly PAHs). 
This is also seen in a stacked IRS spectrum of 4 X- 
ray AGNs. Only another, the brightest AGN, shows 
a significant mid-infrared excess (factor ~3.6) both in 
photometry and the IRS spectrum, with respect to the 
(corrected) template of a star forming galaxy. We de- 
rive a condition on the AGN luminosity and LIR in or- 
der for the AGN to contribute an even or greater 8 yum 
emission than the star formation. The required AGN 
luminosities for massive z~2 main sequence galaxies 
with log(LIR/Lo)~12 are high (Lagn > lO'^^ '* erg s"' ), 
which makes galaxies with a true 8 yum excess due to the 
AGN component rare. The fraction of AGNs dominat- 
ing the 8 yum emission is strongly dependent on the se- 
lection and is likely to increase for log(LIR/L0)<l 1.3. 
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(Italy); lAC (Spain). This development has been supported by 
the funding agencies BMVIT (Austria), ESA-PRODEX (Bel- 
gium), CEA/CNES (France), DLR (Germany), ASI (Italy), 
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APPENDIX 
A. DETERMINING LIR 

The first and most important piece of information we need for each galaxy is its 8-1000 fim LIR. As discussed in Section[3] 
the observed 24 fim flux is problematic as an LIR estimator for z~2 galaxies and its deviation from the local templates is one of 
the main issues we wish to investigate. We therefore exclude it from the LIR determination which we base exclusively on the 
energetically dominant rest-frame far-infrared data. This is done also to avoid subtle bias by which, depending on the specific 
adopted weighting scheme, 24 jim fluxes might aff'ect the LIR from template fits to combined mid- and far-infrared data. Only a 
small number of very luminous galaxies are detected at 70 fim and only GOODS-S was observed at this wavelength with PACS, 
which leaves us with PACS 160 jum and possibly PACS 100 fim to constrain both shape and scale of the SEDs. 
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Fig. 14. — yL,,/LIR for various templates of relevant luminosities from the CEOl SED library. Vertical dashed lines indicate the rest-frame wavelengths observed 
at 160 fim for z=1.5 and 2.5. 



Our basic strategy is to derive LIR from fitting CEOl templates to PACS photometry points close to rest frame 60 jum. For 
reasons described below, at z~2 we use only the observed 160 jum and fit only the scale, maintaining the CEOl LIR-SED shape 
relation. At z~l we use both 100 and 160 fim and fit both SED shape and scale. We sometimes designate the PACS-based LIR at 
z~ 2 as L1R(160) for a clear distinction from a MlPS-based L1R(24), but refer to 8-1000 yum rest wavelength in all cases. 

For the higher redshift z~2 sources, detection rates are limited in the 100 and 70 /urn bands compared to 160 fj.m for most 
luminosity bins. Selecting only the sources with two far-infrared points will bias our sample to the galaxies with bluer far- 
infrared colors and limit the ability to use stacking to compensate for the non-detections. 

Fortunately, with Herschel-PACS we are measuring LIR close to the far-infrared peak if using PACS 160 fim at z~2. The 
determination of LIR is thus much less sensitive to the exact shape of the assumed SED. The flux density at rest frame wavelength 
of 60 iim is a particularly good measurement for a monochromatic luminosity deter mination as w e shall explain below. 

Elbaz et al. (2010) demonstrated that for redshifts 1 < z < 3, fitting the Chary & Elbaz (2001) SED library to a single PACS 
160 jum measurement results in an excellent agreement with the LIR measured from combined Herschel PACS+SPIRE photome- 
try (sampling the full far-infrared SED peak). The robustness of LIR determined from monochromatic 160 /vm luminosity can be 
understood by looking at the templates normalized to their total IR luminosity. The conversion factor from the observed flux ¥y{A) 
to the total IR luminosity is vLy(/l)/LlR. Several templates from the CEOl SED library are plotted in Figure [14] in vLv(/l)/LIR 
scale. The templates for the relevant luminosities of L1R=10"-10''' L© cross each other between 50-60 //m, where the 160 /vm 
filter samples the SED at 1.5<z<2.5 (dashed vertical lines). Of course, at low enough luminosities such as log(LlR/LQ)=10.5 
(~3.5 Mq yr"') the template deviates for the typical vLv/LlR at 60 jum. However, our typical luminosities are more than an order 
of magnitude higher than that and such cold SEDs will not be common. 

Nearly identical vLy(/l)/LlR values for the different templates at a given wavelength mean that for the redshifts at which the 
filter is centered on this wavelength, LIR can be measured regardless of the specific template that is selected. Possible systematic 
errors due to selecting a template different from the optimal one are very small and can be visually seen in Figure [14] as the 
distance between the SED lines (up to 0. 1 dex around 60 //m). This fortunate coincidence around 60 /im will graduall y worsen if 
the dust temperatures get significantly lower than the local LIR-T relation implemented in the CEOl library. Hwang et al.l (120101) 
studied deep Herschel data from the far-infrared to sub- milUmeter and rep ort that this is not the case and this result agrees with 
the accuracy of LI R measured from 160 jum as found bv lElbaz et al.l (12010) . Further support is given to this by the deeper data of 
lElbaz et al.l (l2011h . We therefore measure LIR for z~2 galaxies by fitting CEOl templates to 160 jum fluxes alone, even in cases 
where shorter wavelength fluxes are available. 

The reason not to use PACS 70 and 100 jum even when available is twofold: One is that the fairly low detection rates at these 
wavelengths. If we use the added data points per galaxy to relax the luminosity-template relation in the library it means that we 
will be treating the part of the population which is detected at 100 /im differently than the non-detections. The other is that the 
z~2 70 and 100 yum bands observe ~25 and 35 yum rest, far from the SED peak and a region of the SED which is not part of the 
modified blackbody emission and for which the vLv(/l)/LlR values can vary significantly between templates (Figure [T4b. Using 
this SED region to constrain LIR means that we allow a measurement that represents a small fraction of the total luminosity to 



Evolving infrared SEDs and star formation rate estimates 



23 




11.4 11.6 11.8 12.0 12.2 12.4 12.6 12.8 

log LIR(160 /* 



Fig. 15. — Comparison of total infrared luminosity LIR(all PACS bands) derived from all available PACS bands (at least two detections), versus total infrared 
luminosity LIR(160) from 160 /jm alone. LIR(all PACS bands) is determined by fitting all available PACS bands with free scale and template shapes in the CEOl 
library. LIR(160), used throughout the paper, is determined by fitting 160 /im alone, maintaining the original luminosity-template relation in the library. We 
select all 1.5<z<2.5 galaxies in our sample that have two or three PACS fluxes (45 and 15 out of 128). The statistics for Alog(LIR) = log(LlR(all PACS bands)) - 
log(LIR(160)) are given in the figure. Errorbars represent formal random errors only, propagated from the photometric errors. 



determine LIR. This can lead to systematic errors if the selected template is inaccurate and even larger errors if we do not thaw 
the luminosity-template relation in the library. The combination of these two effects makes the potential biases and systematic 
errors outweigh the gain in added constraints to the LIR determination. 

In any case, the difference when including or excluding the shorter wavelengths is quite small as we demonstrate in FigurefTsl 
In our z~2 sample, considering only sources that have 160 fim and at least one other PACS band detections (half of our full z~2 
sample), we fit all available PACS bands with CEOl templates (free template and scales, minimization) and compare with the 
LIR derived by a monochromatic 160 fim fit. The median difference and median absolute deviation (MAD) between the two 
methods are 0.02 and 0.03 dex and the maximum difference is 0.28 dex. The added (and possibly biased) constraint provided 
by the 100 and 70 fim points is used for selecting a slightly different template, that still results in nearly the same LIR which 
is determined by the 160 yum flux almost independently (see Figure fT4li. Only 5 of 60 galaxies deviate in their monochromatic 
LIR(160) from the multi-band LIR(PACS) by more than 1-cr, the largest of which is 1.5 cr. One should keep in mind that due 
to the detection limits, many galaxies with real 100/160 fim flux ratios that are lower than our sample values will simply not be 
detected in 100 fim and are not represented in this comparison. 

Concerning our lower redshift z~l bin, the far-infrared rest-frame wavelengths scanned by the three PACS photometers band 
for 0.7 < z < 1.3 spans across 20-95 fim, overlapping in large part the range covered for 1 .5 < z < 2.5 (20-65 fim). At these 
lower redshifts, the PACS 160 fim filter has moved farther away from the rest frame 60 fim and PACS 100 fim filter has moved 
closer. The lowest luminosities in the lower redshift sample are of log(LIR/L0)x!lO.8, which may include colder SEDs than those 
for which the 60 fim derived LIR is ideal. By coincidence, for the lower luminosities of the z~l sample (log(LIR/L0)<12), the 
templates cross at rest 80-90 fim (Figure O, which is again the rest wavelength of the 160 fim filter At these redshifts 100 fim 
has a much higher detection rate (100% of the 160 fim sources in most bins we use) and probes a much more useful rest-frame 
wavelength (~50 fim) than it does at z~2. The arguments for using only 160 fim mentioned above do not apply to this sample. 
For LIR determination at z~l, we fit the best combination of CEOl template shape and scale (as two free parameters) to the 
100 and 160 fim points simultaneously. For a few cases at the lowest luminosities where only 160 fim is available and then a 
monochromatic fit to LIR is used. 

B. STACKING ANALYSIS 
Measuring the stacked flux 

Sources which are undetected in 70 or 100 fim (but detected at 160 fim) are stacked to produce a mean photometric point 
representing this population. Stacking is performed into a residual map from which all individually detected sources have been 
removed. This is done in order to minimize the contamination from much brighter nearby sources. The viable depth of stacks is 
limited by the density of neighboring sources which are brighter than the typical sources in the stack. In this work we stack into 
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the 70 and 100 //m images on the position of MIPS 24 fim sources, which are detected at 160 yum. This means that the brightness 
of our stacked sources is likely just below the detection threshold, or they will not have been detected at 160 /im. By removing 
the individual detections, our sources of interest are among the brightest sources that are still left in the image. 

The background level for each stacked image is estimated from the position of the peak in the pixel value distribution P(D). This 
method is more reliable than measuring the background from the stacked image itself. Due to selection effects, prior positions 
will tend to be at a minimum distance from the neighboring sources, thus the priors tend to be positioned at local minima in the 
map, which is reflected as a depressed background-level at the center of the stacked image. Background estimated from the edges 
of the stacked image will tend to be higher than the background at the position of the stacked source. Subtracting the background 
according to the PiD) peak and extracting the flux using a limited-radius point-spread function avoids this issue. 

Stacked vLy/LIR 

The value of interest for the stack is vLv(/lo)/LlR, at a certain rest frame wavelength Aq. When converting from stacked flux 
to vLv,(/lo)/LIR one must account for the different redshifts of the stacked sources (affecting both the luminosity-distance and the 
rest wavelength), as well as for the different (known) individual LIR. We use the following weighted mean to get the stack mean 
vLv/LIR: 

^0 = ^filter Kl+Z) (BI) 
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where A/ut is the filter central wavelength, Aq is the adopted rest-frame wavelength for the stacked flux, z and z, are the mean and 
source-j redshifts, Nstack is the number of sources in the stack, Fy the stacked observed flux and Z), the luminosity-distance for 
each source, a is the approximated slope of the SED (vLy oc A") at Ao, which is used to correct the fluxes to a common A^. The 
exact value of a makes only small difference in the limited redshift intervals we use - we assume or « 1.5 close to the slope of 
LIRG templates in CEO I, which is justified posteriorly. 

Uncertainty on the stacked flux 

There are two kinds of uncertainties regarding the stacked flux as an estimator to the mean flux of the population: one is 
photometric, i.e. the significance of the detection of a source in the stacked image. The other is on the stacked flux as the 
estimator to the true mean flux of the stacked population as derived from a finite size sample. The latter depends on the flux 
distribution within the individual sources in the stack. 

In order to estimate the photometric uncertainty a-ph,,, we create a large number of additional stacks, each using random 
positions in the image, where A'' is the number of stacked sources in the real stack. Each random stack is treated in the same way 
the real stack and a source flux is measured. The standard deviation in the fluxes measured from the random stacks is taken as the 
absolute flux uncertainty. This determines the stacked-source detection significance, i.e. the probability to produce such a flux 
measurement from a completely meaningless stack. For stacks without a detected source (flux< So-,,;,,,,) a flux of 3crpi,o, is taken 
as the upper limit. 

The uncertainty on the stacked flux as an estimator to the mean source flux of the stacked population is estimated using a boot- 
strap method: From the list of priors in the stack a new list, identical in size is created by random re-sampling with replacements. 
We perform a large number of these re-samplings and produce a distribution of measured fluxes. A 68% confidence interval is 
calculated from this distribution. The bootstrap error on the mean stacked flux of large stacks already includes photometric flux 
errors. However, on a small stacked sample they might be underestimated. Therefore the larger of the two is taken as the final 
uncertainty. 

C. THE MEAN-SED REDSHIFT-SCAN METHOD: MINIMIZATION 

When fitting an SED template to the collection of photometric points from the subsample and applying ax^ minimization, one 
must take into account that these measurements are not repeated measurements of the same source. The measurements scatter not 
only due to the errors related to the photometry of each measurement CTphot, but also due to the intrinsic scatter in the population 
o'pop- Neglecting to account for this will result in a biased fit which is dominated by the few points with the best signal to noise; 
these would be the galaxies with the bluer far-infrared colors that are better detected at 70 and 100 jum than the redder ones. CTpop 
the intrinsic scatter in vLy{Ao)/LlR of the population must be estimated and included in the standard deviation for each data point 
when calculating the;^^^ to be minimized: 
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The inclusion of CTpgp has an important implication: for cases in which cr,,,,^ > cTphot, measurements of different sources in a 
given filter have nearly the same weight regardless of the photometric error. We must emphasize again that this is the appropriate 
weighting when doing a repeated sampling of a population rather then repeated measurements of the same object. 

When estimating the intrinsic vLy/LlR scatter in a population of galaxies, two things need to be taken into account: one is that 
the photometric points from different sources are for different rest frame wavelengths and need to be k-corrected to a common 
wavelength. The other is that for most cases, a significant fraction of the sources are not detected individually and are included 
as a stacked mean instead. 

When the significant majority of the sources (more than 80%) are detected by the given filter, we can ignore the small fraction 
of undetected sources and estimate the scatter from the detections only. For this purpose we correct all vLy/LIR of the same filter 
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to a common rest-frame wavelength (assuming a vLy oc A^-^ approximated slope) and calculate the standard deviation in vLy/LIR. 
When binning the galaxies by deviation from the main sequence, cTpop is also estimated from the detections only: In a selection 
by constant SSFR, the fainter, non-detected sources tend to be those of galaxies of lower absolute luminosity, not necessarily of 
lower vLv/LIR. Therefore, detections and non-detections are mixed in vLv/LIR of the far-infrared filters. While a weak trend with 
luminosity does exist, it is a secondary effect for cr^o,, estimation. 

When selecting by LIR and a significant fraction (over 20%) of the population is individually undetected in the given filter, 
the scatter is unresolved in its lower part and the weighted mean of the distribution is yet to be determined. In order to estimate 
the standard deviation of the full population, we approximate the distribution to be normal. In this case we can use the fact that 
in a narrow LIR bin the detected and non-detected source populations are fairly well separated by the detection threshold. For 
the stacked sources we only know their mean vLv(/lo)/LIR and we calculate the same value for the detections using equation lB2l 
We then find the standard deviation of a normal distribution, that when split into two sub-populations at a value x^, the sizes of 
the two sub-populations {x < Xc and x > Xc) have the same ratio as N detect IN stack and the difference between their means match 
the difference in the means of the detections and non-detections {^detect - l-i stack)- This problem has a simple and easy numerical 
solution. 



In this appendix we describe how to apply the calibration derived in Section|5]to a template library, expressed as a function of 
vLv(8)/LIR or LIR, which can then be fitted to a galaxy with a known mass and a redshift. The additional implicit information 
we need is the redshift- dependent main sequence SFR(M,,z)ms, that was defined in Section|4]and is based on the calibration of 
iRodighiero et al.l i20lQh. The distance from the main sequence is then A log(SSFR)MS = log(SFR/M,) - log(SFR(M,, z)ms/A^.)- 
We can use Equation |3] or |4] to eliminate Alog(SSFR)MS and find a direct association between a template defining property 
(vLv(8)/LIR or Aceoi) and the LIR it should be scaled to. We will proceed with the specific example of the CEOl library and 
define as the ratio LIR/SFR (Equation[Tli: 



Thus, we scale each of the log(AcEOi/Lo)>10.38 templates in the original library to the new LIR, simply by applying a factor 
LIR/ AcEOi to the template. The log(AcEOi/Lo)<10.38 templates in the library either have vLv(8)/LIR>-0.58 or quickly become 
degenerate in this value. Our calibration does not deal with these low luminosity templates, that tend to represent galaxies well 
below the current IR detection limits at the redshifts of interest. 



D. CALIBRATING A TEMPLATE LIBRARY BY ALOG(SSFR)ms 




log(AcE()l/^0)- 10.58 
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